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Tni£ OF THE INVENTION 

HEPATmS C VIRUS REPUCONS AND REPUCON ENHANCED CELLS 

CROSS-REFERENCE TO RELATED APPUCATIONS 
5 The present application claims priority to U.S. Serial No. 60/263.479, 

filed January 23, 2001, hereby incorporated by reference herein. 

BACKGROUND OF THE INVENTION 

The references cited in the present application are not admitted to be 

10 prior art to the claimed invention. 

It is estimated that about 3% of the world's population are infected 
with the Hepatitis C virus (HCV). (Wasley, et aL, 2000. Semifu Liver Dis. 20, 1-16.) 
Exposure to HCV results in an overt acute disease in a small percentage of cases, 
while in most instances the virus establishes a chronic infection causing liver 

IS inflammation and slowly progresses into liver failure and cirrhosis. (Iwarson, 1994. 
FEMS Microbiol. Rev. 74, 201>204.) In addition, epidemiological surveys indicate an 
important role of HCV in the pathogenesis of hepatocellular carcinoma. (Kew, 1994. 
FEMS Microbiol Rev. 7^211-220, Alter, 1995. Blood85. 1681-1695.) 

The HCV genome consists of a single strand RNA of about 9.5 kb in 

20 length, encoding a precursor polyprotein of about 3000 amino acids. (Choo, et aL, 
1989. Science 244, 362-364, Choo, et aL, 1989. Science 244, 359-362, Takamizawa, 
et al, 1991. / Virol 65. 1 105-1 1 13.) The HCV polyprotein contains the viral 
proteins in the order C-El-E2.p7-NS2-NS3-NS4A-NS4B-NS5A-NS5B. 

Individual viral proteins are produced by proteolysis of the HCV 

25 polyprotein. Host cell proteases release the putative structural proteins C, EI, E2, and 
p7, and create the N-terminus of NS2 at amino acid 810. (Mizushima, et al, 1994. J. 
Virol 68. 2731-2734, Hijikata, etaL 1993. RNA.S. USA 90, 10773-10777.) 

The non-structural proteins NS3. NS4A, NS4B, NS5A and NS5B 
presumably form the virus replication machinery and are released from the 

30 polyprotein. A zinc-dependent protease associated with NS2 and the N-terminus of 
NS3 is responsible for cleavage between NS2 and NS3. (Grakoui, et al, 1993. J. 
Virol 67, 1385-1395, Hijikata, a/., 1993.F.MA5. USA 90, 10773-10777.) A 
distinct serine protease located in the N-terminal domain of NS3 is responsible for 
proteolytic cleavages at the NS3/NS4A, NS4A/NS4B, NS4B/NS5A and NS5A/NS5B 

35 junctions. (Barthenschlager, et al, 1993. / Virol 67, 3835-3844, Grakoui, et al. 

I 
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1993. Proc. Natl Acad Set USA 90, 10583-10587, Tomei, et al, 1993. 7. Virol 67, 
4017-4026.) NS4A provides a cofactor for NS3 activity. (Failla, ^/ a/., 7. Vira/. 1994. . 
68, 3753-3760, De Francesco, et al, U.S. Patent No. 5,739,002.) NS5A is a highly 
phosphorylated protein concurring interferon resistance. (De Francesco, et al, 2000. 
5 Semin Uver Dis., 20(1), 69-83, Pawlotsky, 1999. 7. Viral Hepat. Suppl i, 47^8.) 
NS5B provides an RNA polymerase. (De Francesco, et al.. International Publication 
Number WO 96/37619, Behrens, et al., 1996. EMBO 15, 12-22. Lohmann. et al„ 
1998. Virology 249, 108-118.) 

Lohmann, et al.. Science 285, 110-1 13, 1999, illustrates the ability of a 

10 biscistronic HCV replicon to replicate in a hepatoma cell line. The biscistonic HCV 
replicon contained a neomycin cistron and an NS2-NSSB or an NS3-NS5B cistron. 
"NS2-NS5B" refcis to a NS2-NS3-NS4A-NS4B-NS5A-NS5B polyprotein. "NS3. 
NS5B" refers to a NS3-NS4A-NS4B-NS5A.NS5B polyprotein. 

Bartenschlager, European Patent Application 1 043 399, published 

15 October 11, 2000 (not admitted to be prior art to the claimed invention), describes a 
cell culture system for autonomous HCV RNA replication and protein expression. 
Replication and protein expression is indicated to occur in sufficiently large amounts 
for quantitative determination. European Patent Application 1 043 399 indicates that 
prior cell lines or primary cell cultures infected with HCV do not provide favorable 

20 circumstances for detecting HCV replication. 

SUMMARY OF THE INVENTION 

The present invention features nucleic acid containing one or more 
adaptive mutations, and HCV replicon enhanced cells. Adaptive mutations are 

25 mutations that enhance HCV replicon activity. HCV replicon enhanced cells are cells 
having an increased ability to maintain an HCV replicon. 

An HCV replicon is an RNA molecule able to autonomously replicate 
in a cultured cell and produce detectable levels of one or more HCV proteins. The 
basic subunit of an HCV replicon encodes for a HCV NS3-NS5B polyprotein along 

30 with a suitable 5' UTR-partial core (PC) region and 3' UTR. The 5' UTR-PC region 
is made up of a 5'IJTR region and about 36 nucleotides of the beginning of the core. 
Additional regions may be present including those coding for HCV proteins or 
elements such as the complete core, El, E2, p7 or NS2; and those coding for other 
types of proteins or elements such as a encephalomyocarditis virus (EMCV) internal 

35 ribosome entry site (IRES), a reporter protein or a selection protein. 
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The present application identifies different adaptive mutations that 
enhance HCV replicon activity. Enhancing replicon activity brings about at least one 
of the following: an increase in replicon maintenance in a cell, an increase in replicon 
replication, and an increase in replicon protein expression. 
S Adaptive mutations are described herein by identifying the location of 

the adaptive mutation with respect to a reference sequence present in a particular 
region. Based on the provided reference sequence, the same adaptive mutation can be 
produced in corresponding locations of equivalent regions having an amino acid 
sequence different than the reference sequence. Equivalent regions have the same 

10 function or encode for a polypeptide havirig the same function. 

Replicon enhanced cells are a preferred host for the insertion and 
expression of an HCV replicon. Replicon enhanced cells are initially produced by 
creating a cell containing a HCV replicon and then curing the cell of the replicon. 
The term ''replicon enhanced cell*' includes cells cured of HCV replicons and progeny 

IS of such cells. 

Thus, a first aspect of the present invention describes a nucleic acid 
molecule comprising at least one of the following regions: an altered NS3 encoding 
region, an altered NS5A encoding region, and an altered EMCV IRES region. The 
altered region contains one or more adaptive mutations. Reference to the presence of 

20 particular adaptive mutation(s) does not exclude other mutations or adaptive 

mutations from being present. Adaptive mutations are described with reference to 
either an encoded amino acid sequence or a nucleic acid sequence. 

A nucleic acid molecule can be single-stranded or part of a double 
strand, and can be RNA or DNA. Depending upon the structure of the nucleic acid 

25 molecule, the molecule may be used as a replicon or in the production of a replicon. 
For example, single-stranded RNA having the proper regions can be a replicon, while 
double-stranded DNA that includes the complement of a sequence coding for a 
replicon or replicon intermediate may useful in the production of the replicon or 
replicon intermediate. 

30 Preferred nucleic acid molecules are those containing region(s) from 

SEQ. ID. NOs. 1, 2, or 3, or the RNA version thereof, with one or more adaptive 
mutations. Reference to "the RNA version thereof indicates a ribose backbone and 
the presence of uracil instead of thymine. 

The presence of a region containing an adaptive mutation indicates that 

35 at least one such region is present. In different embodiments, for example, adaptive 
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mutations described herein are present at least in the NS3 region, in the NSSA region, 
in the NS3 and NSSA regions, in the EMCV IRES and NS3 regions, in the EMCV 
and NSSA regions, and in the ECMV IRES, NS3 and NSSA regions. 

Another aspect of the present invention descril)es an expression vector 
S comprising a nucleotide sequence of an HCV replicon or replicon intermediate 
coupled to an exogenous promoter. Reference to a nucleotide sequence ''coupled to 
an exogenous promoter'* indicates the presence and positioning of an RNA promoter 
such that it can mediate transcription of the nucleotide sequence and that the promoter 
is not naturally associated with the nucleotide sequence being transcribed. The 

10 expression vector can be used to produce RNA replicons. 

Another aspect of the present invention describes a recombinant 
human hepatoma cell. Reference to a recombinant cell includes an initially produced 
cell and progeny thereof. 

Another aspect of the present invention describes a method of making 

IS a HCV replicon enhanced cell. The method involves the steps of: (a) introducing and 
maintaining an HCV replicon into a cell and (b) curing the cell of the HCV replicon. 

Another aspect of the present invention describes an HCV replicon 
enhanced cell made by a process comprising the steps of: (a) introducing and 
maintaining an HCV replicon into a cell and (b) curing the cell of the HCV replicon. 

20 Another aspect of the present invention describes a method of making 

a HCV replicon enhanced cell comprising an HCV replicon. The method involves (a) 
introducing and maintaining a first HCV replicon into a cell, (b) curing the cell of the 
replicon, and (c) introducing and maintaining a second replicon into the cured cell, 
where the second replicon may be the same or different as the first replicon. 

25 Another aspect of the present invention describes an HCV replicon 

enhanced cell containing a HCV replicon made by the process involving the step of 
introducing an HCV replicon into an HCV replicon enhanced cell. The HCV replicon 
introduced into the HCV replicon enhanced cell may be the same or different than the 
HCV replicon used to produce the HCV replicon enhanced cell. In a preferred 

30 embodiment, the HCV replicon introduced into an HCV replicon enhanced cell is the 
same replicon as was used to produce the enhanced cell. 

Another aspect of the present invention describes a method of 
measuring the ability of a compound to affect HCV activity using an HCV replicon 
comprising an adaptive mutation described herein. The method involves providing a 

3S compound to a cell comprising the HCV replicon and measuring the ability of the 

4 
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compound to affect one or more leplicon activities as a measure of the effect on HCV 
activity. 

Another aspect of the present invention describes a method of 
measuring the ability of a compound to affect HCV activity using an HCV replicon 
S enhanced cell that comprises an HCV repHcon. The method involves providing a 
compound to the cell and measuring the ability of the compound to effect one or more 
replicon activities as a measure of the effect on HCV activity. 

Other features and advantages of the present invention are apparent 
from the additional descriptions provided herein including the different examples. 
10 The provided examples illustrate different components and methodology useful in 
practicing the present invention. The examples do not limit the claimed invention. 
Based on the present disclosure the skilled artisan can identify and employ other 
components and methodology useful for practicing the present invention. 

15 BRIEF DESCRIPTION OF THE DRAWINGS 

Figures 1 A-IG illustrate the nucleic acid sequence for the 
pHCVNeo.17 coding strand (SEQ. ID. NO. 3). The different regions of pHCVNeo.17 
are provided as follows: 

1-341: HCV 5' non-translated region, drives translation of the core-neo fusion protein; 
20 342- 1181: Core-neo fusion protein, selectable marker, 

1190-1800: Internal ribosome entry site of the encephalomyocarditis virus, drives 
translation of the HCV NS region; 

1801-77SS: HCV polyprotein from non-structural protein 3 to non-structural protein 
5B; 

25 1801-3696: Non-structural protein 3 (NS3X HCV NS3 protease/helicase; 
3697-38S8: Non-structural protein 4A (NS4A), NS3 protease cofactor, 
3859-4641: Non-structural protein 4B (NS4B); 
4642-5982: Non-structural protein 5A (NS5A); 

5983-7755: Non-structural protein 5B (NS5B); RNA-dependent RNA polymerase 
30 7759-7989: HCV 3' non-translated region; and 

7990-10690 plasmid sequences comprising origin of replication, beta lactamase 
coding sequence, and T7 promoter. 
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DETAILED DESCRIPTION OF THE INVENTION 

HCV leplicons and HCV leplicon enhanced cells can be used to 
produce a cell culture providing detectable levels of HCV RNA and HCV protein. 
HCV replicons and HCV replicon enhanced hosts can both be obtained by selecting 
S for the ability to maintain an HCV replicon iii a cell. As illustrated in the examples 
provided below, adaptive mutations present in HCV replicons and host cells can both 
assist replicon maintenance in a cell. 

The detectable replication and expression of HCV RNA in a cell 
culture system has a variety of different uses including being used to study HCV 
10 replication and expression, to study HCV and host cell interactions, to produce HCV 
RNA, to produce HCV proteins, and to provide a system for measuring the ability of a 
compound to modulate one or more HCV activities. 

Preferred cells for use with a HCV replicon are Huh-7 cells and Huh-7 
derived cells. **Huh-7 derived cells*' are cell produced starting with Huh-7 cells and 
IS introducing one or more phenotypic and/or genotypic modifications. 

Adaptive Mutations 
Adaptive mutations enhance the ability of an HCV replicon to be 
maintained and expressed in a host cell. Adaptive mutations can be initially selected 
20 for using a wild type HCV RNA construct or a mutated HCV replicon. Initial 

selection involves providing HCV replicons to cells and identifying clones containing 
a replicon. 

Nucleic acid sequences of identified HCV replicons can be determined 
using standard sequencing techniques. Comparing the sequence of input HCV 

25 constructs and selected constnicts provides the location of mutations. The effect of 
particular mutation(s) can be measured by, for example, producing a construct to 
contain particular mutation(s) and measuring the effect of these mutation(s). Suitable 
control constnicts for comparison purposes include wild type constructs and 
constructs previously evaluated. 

30 Adaptive mutations were predominantly found in the HCV NS3 and 

NS5A regions. With the exception of two silent mutations in NS5A and NS5B, 
consensus mutations occurring in the NS region resulted in changes to the deduced 
amino acid sequence. Noticeably, the amino acid changes occurred in residues that 
are conserved in all or a large number of natural HCV isolates. HCV sequences are 

35 well known in the art and can be found, for example, in GenBank. 
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Ads^tive mutations described herein can be identified with respect to a 
reference sequence. The reference sequence provides the location of the adaptive 
mutation in, for example, the NS3 or NSSA RNA, cDNA, or amino acid sequence. 
The remainder of the sequence encodes for a functional protein that may have the 
S same, or a different, sequence than the reference sequence. 

Preferred NS3 and NSSA adaptive mutations and examples of changes 
that can be made to produce such mutations are shown in Tables 1 and 2. The amino 
acid numbering shown in Tables 1 and 2 is with respect to SEQ. ID. NO. 1. The 
nucleotide numbering shown in Tables 1 and 2 is with respect to SEQ. ID. NO. 2. 
10 SEQ. ID. NO. 1 provides the amino acid sequence of the Conl HCV isolate 

(Accession Number AJ238799). SEQ. ID. NO. 2 provides the nucleic acid sequence 
of the Conl HCV isolate. 

TABLE 1 

15 



Preferred NS3 Adaptive Mutations 


Amino Acid 


Nucleotide 


Rlyl095ala 


G3625C 


glul202gly 


A3946G 


alal347thr 


G4380A 



TABLE 2 



Preferred NSSA Adaptive Mutations 


Amino Acid 


Nucleotide 


Lys@2039 


AAA@6458 


asn2041thr 


A6463C 


ser2173phe 


C6859T 


ser2197phe 


C6931T 


Ieu2198ser 


T6934C 


ala2199thr . 


G6936A 


ser2204arg 


C6953A(orG) 



@" refers to an addition. 
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Preferred adaptive mutations identified with respect to a reference 
sequence can be produced changing the encoding region of SEQ. ID. NO. I, or an 
equivalent ^uence, to result in the indicated change. Preferred adaptive mutations 
S provided in Tables 1 and 2 occur in amino acids conserved among different HC V 
isolates. 

Adaptive mutations have different effects. Some mutations alone, or 

in combination with other mutations, enhance HCV replicon activity. In some cases, 

two or nK>re mutations led to synergistic effects and in one case, a slightly 
10 antagonistic effect was observed. 

An adaptive mutation once identified can be introduced into a starting 

construct using standard genetic techniques. Examples of such techniques sue 

provided by Ausubel, Current Protocols in Molecular Biology, John Wiley, 1987- 

1998, and Sambrook, et al.^ Molecular Clonings A Laboratory Manual, 2"^ Edition, 
15 Cold Spring Harbor Laboratory Press, 1989. 

HCV replicons containing adaptive mutations can be built around an 

NS3 region or NS5A region containing one or more adaptive mutations described 

herein. The final replicon will contain replicon components needed for replication 

and may contain additional components. 
20 SEQ. ID. NO. 2 can be used as a reference point for different HCV 

regions as follows: 

5' UTR- nucleotides 1-341; 

Core- nucleotides 342-914; 

El- nucleotides 915-1490; 
25 E2- nucleotides 1491-2579; 

P7- nucleotides 2580-2768; 

NS2- nucleotides 2769-3419; 

NS3. nucleotides 3420-5312; 

NS4A- nucleotides 5313-5474; 
30 NS4B- nucleotides 5475-6257; 

NS5A- nucleotides 6258-7598; 

NS5B- nucleotides 7599-9371; and 

3' UTR- nucleotides 9374-9605. 

The amino acid sequences of the different structural and non-structural regions is 
35 provided by SEQ. ID. NO. 1 . 



8 



( 



wo 02/059321 PCT/EP02/00526 



Nucleic acid sequences encoding for a particular amino acid can be 
produced taking into account the degeneracy of the genetic code. The degeneracy of 
the genetic code arises because almost all amino acids are encoded for by different 
combinations of nucleotide triplets or ^codons*\ The translation of a particular codon 
5 into a particular amino acid is well known in the art (see, e.g., Lewin GENES IV, p. 
1 19, Oxford University Press, 1990). Amino acids are encoded for by RNA codons as 
follows: 

A=Ala=Alaninc: codons GCA, GCC, GCG, GCU 

C=Cys=Cysteine: codons UGC, UGU 
10 D=Asp=Aspartic acid: codons GAC, GAU 

E=Glu=<}lutamic acid: codons GAA, GAG 

F^Phe=Phenylalanine: codons UUC, UUU 

G=<jIy=Glycinc: codons GGA, GGC, GGG, GGU 

H=His=Histidine: codons CAC, CAU 
15 I=Ile=Isoleucine: codons AUA, AUG, AUU 

K=Lys=Lysine: codons AAA, AAG 

L=Ijeu=Leucine: codons UUA, UUG, CUA, CUC, CUG, CUU 

M=Met=Methionine: codon AUG 

N=Asn=Asparagine: codons AAC, AAU 
20 P=Pro=Proline: codons CCA, CCC, CCG. CCU 

Q=Gln=Glutamine: codons CAA, CAG 

R=Arg=Arginine: codons AGA, AGG, CGA. CGC, CGG, CGU 

S=Sen=Serine: codons AGC, AGU, UCA, UCC, UCG, UCU 

T=Thn='nireonine: codons ACA, ACC, ACG, ACU 
25 V=Val=Valinc: codons GUA, GUC, GUG. GUU 

W=Trp=Tryptophan: codon UGG 

Y=Tyr=Tyrosine: codons UAC, UAU. 

Constructs, including subgenomic and genomic replicons, containing 

one or more of the adaptive mutations described herein can also contain additional 
30 mutations. The additional mutations may be adaptive mutations and mutations not 

substantially inhibiting replicon activity. Mutations not substantially inhibiting 

replicon activity provide for a replicon that can be introduced into a cell and have 

detectable activity. 
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HCV Replicon 

HCV replicons include the full length HCV genome and subgenomic 
constructs. A basic HCV replicon is a subgenomic construct containing an HCV S' 
UTR- PC region, an HCV NS3-NS5B polyprotein encoding region, and a HCV 3* 
5 UTR. Other nucleic acid regions can be present such as those providing for HCV 
NS2, stnictural HCV protein(s) and non-HCV sequences. 

The HCV 5' UTR-PC region provides an internal ribosome entry site 
(IRES) for protein translation and elements needed for replication. The HCV 5*UTR- 
PC region includes naturally occurring HCV 5* UTR extending about 36 nucleotides 
10 into a HCV core encoding region, and functional derivatives thereof. The S'-UTR-PC 
region can be present in different locations such as site downstream from a sequence 
encoding a selection protein, a reporter, protein, or an HCV polyprotein. 

Functional derivatives of the S*-UTR-PC region able to initiate 
traiislation and assist replication can tie designed taking into structural requirements 
IS for HCV translation initiation. (See, for example, Honda, et al., 1996. Virology 222, 
3M2). The affect of different modifications to a 5' UTR-PC region can be 
determined using techniques that measure replicon activity. 

In addition to the HCV 5' UTR-PC region, non-HCV IRES elements 
can also be present in the replicon. The non-HCV IRES elements can be present in 
20 different locations including immediately upstream the region encoding for an HCV 
polyprotein. Examples of non-HCV IRES elements that can be used are the EMCV 
IRES, poliovirus IRES, and bovine viral diarrhea virus IRES. 

The HCV 3' UTR assists HCV replication. HCV 3' UTR includes 
naturally occurring HCV 3* UTR and functional derivatives thereof. Naturally 
25 occurring 3' UTR's include a poly U tract and an additional region of about 100 

nucleotides. (Tanaka, a/., 1996.7. Virol 70, 3307-3312, Kolykhalov, em/., 1996. 
J. Virol 70, 3363-3371.) At least in vivo, the 3' UTR appears to be essential for 
replication. (Kolykhalov, et al, 2000. J. Virol 2000 4, 2046-2051.) Examples of . 
naturally occurring 3* UTR derivatives are described by Bartenschlager International 
30 Publication Number EP 1 043 399. 

The NS3-NS5B polyprotein encoding region provides for a polyprotein 
that can be processed in a cell into different proteins. Suitable NS3-NS5B polyprotein 
sequences that may be part of a replicon include those present in different HCV 
strains and functional equivalents thereof resulting in the processing of NS3-NS5B to 
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a produce functional replication machinery. Proper processing can be measured for 
by assaying, for example, NS5B RNA dependent RNA polymerase. 

The ability of an NS5B protein to provide RNA polymerase activity 
can be measured using techniques well known in the art. (See, for example, De 
5 Franscesco, et aL, International Publication Number WO 96/37619, Behrens, et ai, 
1996. £M50 75:12-22, Lohmann, era/., 1998. Viro/ogy 249:108-118.) Preferably, 
the sequence of the active NSSB is substantially similar as that provided in SEQ. ID. 
NO. 1, or a wild type NS5B such as strains HCV-1, HCV-2, HCV-BK, HCV-J, HCV- 
N, HCV-H. A substantially similar sequence provides detectable HCV polymerase 
10 activity and contains 1, 2, 3, 4, 5, 6, 7,4, 9, 10, 1 1, 12, 13, 14, or 15 amino acid 

alterations to that present in a HCV NSSB polymerase. Preferably, no more than 1, 2, 
3, 4 or S alterations are present. 

Alterations to an amino acid sequence provide for substitution(s), 
insertion(s), deletion(s) or a combination thereof. Sites of different alterations can be 
IS designed taking into account the amino acid sequences of different NSSB polymerases 
to identify conserved and variable amino acid, and can be empirically determined. 

HCV replicons can be produced in a wide variety of different cells and 
in vitro. Suitable cells allow for the transcription of a nucleic acid encoding for an 
HCV replicon. 

20 

Additional Sequences 
An HCV replicon may contain non-HCV sequences in addition to 
HCV sequences. The additional sequences should not prevent replication and 
expression, and preferably serve a useful function. Sequences that can be used to 
2S serve a useful function include a selection sequence, a reporter sequence, transcription 
elements and translation elements. 

Selection Sequence 

A selection sequence in an HCV replicon facilitates the identification 
30 of a cell containing the replicon. Selection sequences are typically used in 

conjunction with some selective pressure that inhibits growth of cells not containing 
the selection sequence. Examples of selection sequences include sequences encoding 
for antibiotic resistance and ribozymes. 

Antibiotic resistance can be used in conjunction with an antibiotic to 
35 select for cells containing replicons. Examples of selection sequences providing for 
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antibiotic resistance are sequences encoding resistance to neomycin, hygromycin, 
puromycin, or zeocin. 

A ribozyme serving as a selection sequence can be used in conjunction 
with an inhibitory nucleic acid molecule that prevents cellular growth. The ribozyme 
S recognizes and cleaves the inhibitory nucleic acid. 

Reporter Sequence 

A reporter sequence can be used to detect replicon replication or 
protein expression. Preferred reporter proteins are enzymatic proteins whose presence 

10 can be detected by measuring product produced by the protein. Examples of reporter 
proteins include, luciferase, beta-lactamase, secretory alkaline phosphatase, beta- 
glucuronidase, green fluorescent protein and its derivatives. In addition, a reporter 
nucleic acid sequence can be used to provide a reference sequence that can be targeted 
by a complementary nucleic acid. Hybridization of the complementary nucleic acid to 

IS its target can be determined using standard techniques. 

Additional Sequence Configuration 

Additional non-HCV sequences are preferable 5' or 3' of an HCV 
replicon genome or subgenomic genome region. However, the additional sequences 
20 can be located within an HCV genome as long as the sequences do not prevent 

detectable replicon activity. If desired, additional sequences can be separated from 
the replicon by using a ribozyme recognition sequence in conjunction with a 
ribozyme. 

Additional sequences can be part of the same cistron as the HCV 
25 polyprotein or can be a separate cistron. If part of the same cistron, the selection or 
reporter sequence coding for a protein should result in a product that is either active as 
a chimeric protein or is cleaved inside a cell so it is separated from HCV protein. 

Selection and reporter sequences encoding for a protein when present 
as a separate cistron should be associated with elements needed for translation. Such 
30 elements include a 5" IRES. 

Detection Methods 
Methods for detecting replicon activity include those measuring the 
production or activity of replicon RNA and encoded for protein. Measuring includes 
35 qualitative and quantitative analysis. 
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Techniques suitable for measuring RNA production include those 
detecting the presence or activity of RNA. The presence of RNA can be detected 
using, for example, complementary hybridization probes or quantitative PCR. 
Techniques for measuring hybridization between complementary nucleic acid and 
5 quantitative PCR are well known in the art. (See for example, Ausubel, Current 

Protocols in Molecular Biology, John Wiley, 1987-1998, Sambrook, et ai. Molecular 
Cloning, A Laboratory Manual, 2^ Edition, Cold Spring Harbor Laboratory Press, 
1989, and U.S. Patent No. 5,73 1,148.) 

RNA enzymatic activity can be provided to the replicon by using a 

10 ribozyme sequence. Ribozyme activity can be measured using techniques detecting 
the ability of the ribozyme to cleave a target sequence. 

Techniques for measuring protein production include those detecting 
the presence or activity of a produced protein. The presence of a particular protein 
can be determined by, for example, immunological techniques. Protein activity can 

IS be measured based on the activity of an HCV protein or a reporter protein sequence. 

Techniques for measuring HCV protein activity vary depending upon 
the protein that is measured. Techniques for measuring the activity of different non- 
structural proteins such as NS2/3, NS3, and NS5B, are well known in the art. (See, 
for example, references provided in the Background of the Invention.) 

20 Assays measuring replicon activity also include those detecting virion 

production from a replicon that produces a virion; and those detecting a cytopathic 
effect from a replicon producing proteins exerting such an effect. Cytopathic effects 
can be detected by assays suitable to measure cell viability. 

Assays measuring replicon activity can be used to evaluate the ability 

25 of a compound to modulate HCV activities. Such assays can be carried out by 
providing one or more test compounds to a cell expressing an HCV replicon and 
measuring the effect of the compound on replicon activity. If a preparation containing 
more than one compound is found to modulate replicon activity, individual 
compounds or smaller groups of compounds can be tested to identify replicon active 

30 compounds. 

Compounds identified as inhibiting HCV activity can be used to 
produce replicon enhanced cells and may be therapeutic compounds. The ability of a 
compound to serve as a therapeutic compound can be confirmed using animal models 
such as a chimpanzee to measure efficacy and toxicity. 

35 
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Replicon Enhanced Host Cell 
RepHcon enhanced cells are initially produced by selecting for a cell 
able to maintain an HCV leplicon and then curing the cell of the replicon. Cells 
produced in this fashion were found to have an increased ability to maintain a replicon 
5 upon subsequent HCV replicon transfection. 

Initial transfection can be performed using a wild-type replicon or a 
replicon containing one or more adaptive mutations. If a wild-type replicon is 
employed, the replicon should contain a selection sequence to facilitate replicon 
maintenance. 

10 Cells can be cured of replicons using different techniques such as those 

employing replicon inhibitory agent. In addition, replication of HCV replicons is 
substantially reduced in confluent cells. Thus, it is conceivable to cure cells of 
replicons by culturing them at a high density. 

Replicon inhibitory agents inhibit replicon activity or select against a 

IS cell containing a replicon. An example of such an agent is IFN-a. Other HCV 
inhibitory compounds may also be employed. HCV inhibitor compounds are 
described, for example, in Llinas-Brunet, et al., 2000. Bioorg Med Chem. Lett. 10(20)^ 
2267-2270. 

The ability of a cured cell to be a replicon enhanced cell can be 
20 measured by introducing a replicon into the cell and determining efficiency of 
subsequent replicon maintenance and activity. 

EXAMPLES 

Examples are provided below to further illusU^te different features of 
25 the present invention. The examples also illustrate useful methodology for practicing 
the invention. These examples do not limit the claimed invention. 

Example 1: Techniques 

This example illustrates the techniques employed for producing and 
30 analyzing adapti ve mutations and replicon enhanced cells. 

Manipulation of Nucleic Acids and Construction of Recombinant Plasmids 

Manipulation of nucleic acids was done according to standard 
protocols. (Sambrook, et aL 1989. Molecular Cloning: A Laboratory Manual, 2"^ ed. 
35 Cold Spring Harbor Laboratory, Cold Spring Harbor. N Y.) Plasmid DNA was 
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prepared from ON culture in LB broth using Qiagen SOO columns according to 

manufacturer instructions. 

Plasmids containing desired mutations were constructed by restriction 

digestion using restriction sites flanking the mutations or by PGR amplification of the 
S area of interest, using synthetic oligonucleotides with the appropriate sequence. Site 

directed mutagenesis was carried out by inserting the mutations in the PGR primers. 

PGR amplification was performed using high fidelity thermostable polymerases or 

mixtures of polymerases containing a proofreading enzyme. (Barnes, et al., 1994. 

Proc, Natl. Acad ScL 91, 2216-2220.) AH plasmids were verified by restriction 
10 mapping and sequencing. 

pHGVneol7.wt contains the cDNA for an HGV bicistronic replicon 

identical to replicon l377neoWS3-37wt described by Bartenschlager (SEQ. ID. NO. 3) 

(Lohmarin.era/., 1999. Science 255.1 10-1 13, EMBL-genbank No. AJ242652). The 

plasmid comprises the following elements: 5* untranslated region of HGV comprising 
IS the HGV-IRES and part of the core (ntl-377); neomycin phosphotransferase coding 

sequence; and EMGV IRES; HGV coding sequences from NS3 to NS5B; 3* UTR of 

HGV. 

Plasmid pHGVNeolT.GAA is identical to pHGVNeo.l7, except that 
the GAG triplets (nt. 6934-6939 of pHGVNeol7 sequence) coding for the catalytic 
20 aspartates of the NS5B polymerase (amino acids 2737 and 2738 of HGV polyprotein) 
were changed into GGG, coding for alanine. 

Plasmid pHGVNeol7.m0 is identical to pHGVNeol7» except that the 
triplet AGG (nt. 5335-5337 of pHGVNeol7 sequence) coding for the serine of NS5A 
protein (amino acid 2204 of HGV polyprotein) was changed into AGA, coding for 
25 arginine. 

Plasmid pHGVNeol7.ml is identical to pHGVNeol7, except that the 
triplet AAG (nt. 4846-4848 of pHGVNeol7 sequence) coding for the asparagine of 
NS5A protein (amino acid 2041 of HGV polyprotein) was changed into ACG, coding 
for threonine. 

30 Plasmid pHGVNeol7.m2 is identical to pHGVNeol7, except that the 

triplet TGG (nt. 5242-5244 of pHGVNeol7 sequence) coding for the serine of NS5A 
protein (amino acid 2173 of HGV polyprotein) was changed into TTG, coding for 
phenylalanine. 
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Plasmid pHCVNeol7.m3 is identical to pHCVNeoi?, except that the 
triplet TCC (nt. 5314-5316 of pHCVNeol? sequence) coding for the serine of NS5A 
protein (amino acid 2197 of HCV polyprotein) was changed into TTC, coding for 
phenylalanine. 

5 Plasmid pHCVNeol7.m4 is identical to pHCVNeol7, except that the 

triplet TTG (nt. 5317-5319 of pHCVNeol7 sequence) coding for the leucine of NS5A 
protein (amino acid 2198 of HCV polyprotein) was changed into TCG, coding for 
serine. 

Plasmid pHCVNeol7.m5 is identical to pHCVNeol7, except that an 

10 extra triplet AAA coding for lysine was inserted after the triplet GTG (nt. 4840-4843 
of pHCVNeol7 sequence), coding for valine 2039 of HCV polyprotein. 

Plasmid pHCVNeol7.m6 is identical to pHCVNeol7, except that the 
triplets GAA and GCC (nt. 2329-2331 and 2764-2766 of pHCVNeol7 sequence) 
coding for the glutamic acid and the alanine of NS3 protein (amino acid 1202 and 

15 1347 of HCV polyprotein) were changed respectively into GGA and ACC, coding for 
glycine and threonine. The triplet TCC (nt. 5242-5244 of pHCVNeol7 sequence) 
coding for the serine of NS5A protein (amino acid 2173 of HCV polyprotein) was 
changed into TTC, coding for phenylalanine; an extra adenosine was inserted into the 
EMCV IRES (after the thymidine 1736 of the replicon sequence). 

20 Plasmid pHCVNeol7.m7 is identical to pHCVNcol7, except that the 

triplet AAC (nt. 4846-4848 of pHCVNeol7 sequence) coding for the asparagine of 
NS5A protein (amino acid 2041 of HCV polyprotein) was changed into ACC, coding 
for threonine; the triplet TCC (nt. 5242-5244 of pHCVNeol7 sequence) coding for 
the serine of NS5A protein (amino acid 2173 of HCV polyprotein) was changed into 

25 TTC, coding for phenylalanine. 

Plasmid pHCVNeol7.m8 is identical to pHCVNeol7, except that the 
triplet AAC (nt. 4846-4848 of pHCVNeol7 sequence) coding for the asparagine of 
NS5 A protein (amino acid 2041 of HCV polyprotein) was changed into ACC, coding 
for threonine; the triplet TCC (nt. 5314-5316 of pHCVNeol7 sequence) coding for 

30 the serine of NS5A protein (amino acid 2197 of HCV polyprotein) was changed into 
TTC. coding for phenylalanine. 

Plasmid pHCVNeol7.m9 is identical to pHCVNeol7, except that the 
triplet AAC (nt. 4846-4848 of pHCVNeol7 sequence) coding for the asparagine of 
NS5A protein (amino acid 2041 of HCV polyprotein) was changed into ACC, coding 
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for threonine; the triplet TTG (nt. 5317-5319 of pHCVNeol? sequence) coding for 
the leucine of NSSA protein (amino acid 2198 of HCV polyprotein) was changed into 
TCG, coding for serine. 

Plasmid pHCVNeol 7.mld is identical to pHCVNeolT, except that the 
5 triplet GA A (nt. 2329-233 1 of pHCVNeol? sequence) coding for the glutamic acid of 
NS3 protein (amino acid 1202 of HCV polyprotein) was changed into GGA, coding 
for glycine; an extra triplet AAA coding for lysine was inserted after the triplet GTG 
(nt. 4840-4843 of pHCVNeol? sequence), coding for valine 2039 of HCV 
polyprotein. 

10 Plasmid pHCVNeol?.ml 1 is identical to pHCVNeol?, except that the 

triplet TCC (nt. 5314-5316 of pHCVNeol? sequence) coding for the serine of NSSA 
protein (amino acid 219? of HCV polyprotein) was changed into TTC, coding for 
phenylalanine. The triplet GCC (nt 5320-5322 of pHCVNeol? sequence) coding for 
the alanine of NS5A protein (amino acid 2199 of HCV polyprotein) was changed into 

IS ACC coding for threonine. 

Plasmid pHCVNeol?.ml2 is identical to pHCVNeol?, except that the 
triplet AAC (nt. 4846-4848 of pHCVNeol? sequence) coding for the asparagine of 
NSSA protein (amino acid 2041 of HCV polyprotein) was changed into ACC, coding 
for threonine; the triplet TCC (nt. 5314-5316 of pHCVNeol? sequence) coding for 

20 the serine of NSSA protein (amino acid 219? of HCV polyprotein) was changed into 
TTC, coding for phenylalanine. The triplet GCC (nt. 5320-5322 of pHCVNeol? 
sequence) coding for the alanine of NSSA protein (amino acid 2199 of HCV 
polyprotein) was changed into ACC coding for threonine. 

Plasmid pHCVNeol?.ml3 has the same mutations as 

25 pHCVNeol?.m8, but also an extra adenosine inserted into the EMCV IRES (after the 
thymidine 1?36 of the replicon sequence). 

Plasmid pHCVNeol?.ml4 has the same mutations as 
pHCVNeol?.ml 1, but also an extra adenosine inserted into the EMCV IRES (after 
the thymidine 1?36 of the replicon sequence). 

30 Plasmid pHCVNeol?.ml5 is identical to pHCVNeol?, except that the 

triplet GCC (nt. 5320-5322 of pHCVNeol? sequence) coding for the alanine of 
NSSA protein (amino acid 2199 of HCV polyprotein) was changed into ACC coding 
for threonine. 
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Plasmid pRBSEAP.S is a pHCVNeo.17 derivative where the Neo 
coding sequence has been replaced with the sequence coding for the human placental 
alkaline phosphatase corresponding to nucleotides 90-1580 of pBC12^SV/SEAP 
plasmid. (Beiger, er a/., 19SS. Gene 66, MO.) 

5 

RNA Transfection 

Transfection was performed using Huh-7 cells. The cells were grown 
in Dulbecco's modified minimal essential medium (DMEM, Gibco, BRL) 
supplemented with 10% PCS. For routine work, cells were passed 1 to S twice a 

10 week using Ix trypsin/EDTA (Gibco, BRL). 

Plasmids were digested with the Seal endonuclease (New England 
Biolabs) and transcribed in vitro with the T7 Megascript kit (Ambion). Transcription 
mixtures were treated with DNase I (0.1 U/ml) for 30 minutes at 37^C to completely 
remove template DNA, extracted according to the procedure of CKomczynski 

IS (Chomczynski, et al. 1987. Anal Biochenu 162, 156*1S9)» and resuspended with 
RNase-free phosphate buffered saline (rfPBS. Sambrook, et aL, 1989. Molecular 
Cloning: A Laboratory Manual, 2*^ ed. Cold Spring Harbor Laboratory, Cold Spring 
Harbor, N.Y.). 

RNA transfection was performed as described by Liljestrom, et aL, 
20 1991. J. ViroL 6, 4107-41 13, with minor modifications. Subconfluent, actively 
growing cells were detached from the tissue culture container using trypsin/EDTA. 
Trypsin was neutralised by addition of 3 to 10 volumes of DMEM/10%FCS and cells 
were centrifuged for 5 minutes at 1200 rpm in a Haereus table top centrifuge at 40C. 
Cells were resuspended with ice cold rfPBS by gentie pipetting, counted with a 
25 haemocitometer, and centrifuged as above. rfPBS wash was repeated once and cells 
were resuspended at a concentration of 1-2 x 10^ cell/ml in rfPBS. Aliquots of cell 
suspension were mixed with RNA in sterile eppendorf tubes. The RNA/cell mixture 
was inmiediately transferred into the electroporation cuvette (precooled on ice) and 
pulsed twice with a gene pulser apparatus equipped with pulse controller (Biorad). 
30 Depending on the experiment, 0. 1 , 0.2 or 0.4 cm electrode gap cuvettes were used, 
and settings adjusted (Table 3). 
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TABLE 3 



Cuvette 


Volume 


Voltage 


Capacitance 


Resistance 


RNA 


gap (cm) 




(Volts) 


(uFa) 


(ohm) 


(UR) 


0.1 


70 


200 


25 


infinite 


i-lO 


0.2 


200 


400 


25 


infinite 


5-20 


0.4 


800 


800 


25 


infinite 


15-100 



After the electric shock, cells were left at room temperature for 1-10 
5 minutes (essentially the time required to electroporate all samples) and subsequently 
diluted with at least 20 volumes of DMEM/10%FCS and plated as required for the 
experiment. Survival and transfection efficiency were monitored by measuring the 
neutral red uptake of cell cultured for various days in the absence or in the presence of 
neomycin sulfate (G418). With these parameters, survival of Huh-7 cells was usually 
10 40-60% and transfection efficiency ranged between 40% and 100%. 

Sequence Analysis of Replicon RNAs 

The entire NS region was recloned from 3 different transfection 

experiments performed with HCVNco.17 RNA. RNA was extracted from selected 
15 clones either using the Qiagen RNAeasy minikit following manufacturer instructions 

or as de$crit)ed by Chomczynski, et aL, 1987. Anal. Biochem. 162, 156-159. 

Replicon RNAs (5 ^g of total cellular RNA) were retro-transcribed 

using oligonucleotide HCVG34 (5'- ACATGATCH'GCAGAGAGGCCAGT-S*; SEQ. 

ID. No. 4) and the Superscript II reverse transcriptase (Gibco, BRL) according to 
20 manufacturer instructions, and subsequently digested with 2 U/ml Ribonuclease H 

(Gibco BRL). The cDNA regions spanning from the EMCV IRES to the HCV 3' end 

were amplified by PGR using oligonucleotides HCVG39 (5'- 

GACASGCTGTGATAWATGTCTCCCCC-3'; SEQ. ID. NO. 5) and CITE3 (5'. 

TGGCTCtCCTCAAGCGTATTC -3'; SEQ. ID. NO. 6) and the LA Taq DNA 
25 polymerase (Takara LA Taq). 

Amplified cDNAs were digested with the Kpnl endonuclease (New 

England Biolabs) and the 5.8 kb fragments were gel purified and ligated to the 5.6 kb 

vector fragment (purified from plasmid pRBSEAP.5 digested with Kpnl) using T4 



19 



wo 02/059321 



PCT/EP02/00526 



DNA Hgase (New England Biolabs) according to manufacturer instructions. Ligated 
DNAs were transformed by electroporation in DHIOB or JMl 19 strains of E. colL 

In the case of NSSA region, total RNA isolated from 3 clones, (HB77, 
HB60 and HB68) was extracted and used for RT-PCR. 5|ig of total RNA plus 20 
5 pmole of AS61 oligo (5*-ACTCTCTGCAGTCAAGCGGCTCA-3\ RT antisense 
oligo; SEQ. ID. NO. 7) were heated 5 minutes at 95^*0, then DMSO (5% f.c), DTT 
(10 mM f.c), 1 mM dNTP (1 mM f.c), Ix Superscript buffer (1 x f.c), and 10 u 
Superscript (Gibco) were added to a total volume of 20 fil and incubated 3 hours at 
42*^0. 2^1 of this RT reaction were used to perform PGR with oligos S39 (5'- 

10 CAGTGGATGAACCGGCTGATA«3\ sense; SEQ. ID. NO. 8) or S41 (5*- 

GGGGCGACGGCATCATGCAAACC-3\ sense; SEQ. ID. NO. 9) and B43 (5*- 
CAGGACCTGCAGTCTGTCAAAGG-3\ antisense; SEQ. ID. NO. 10) using 
Elongase Enzyme Mix (Gibco) according the instruction provided by the 
manufacturer. The resulting PGR fragment was cloned in pCR2.1 vector using the 

IS TA Cloning kit (Invitrogen) and transformed in ToplOF' bacterial strain. 

Plasmid DNA was prepared from ON culture of the resulting 
ampicillin resistant colonies using Qiagen 500 columns according to manufacturer 
instructions. The presence of the desired DNA insert was ascertained by restriction 
digestion, and the nucleotide sequence of each plasmid was determined by automated 

20 sequencing. Nucleotide sequences and deduced amino acids sequences were aligned 
using the GCG software. 

TaqMan 

TaqMan analysis was typically performed using 10 ng of RNA in a 
25 reaction mix (TaqMan Gold RT-PCR kit, Perkin Elmer Biosystems) either with HCV 
specific oligos/probe (oligo 1: 5'-CGC}GAGAGCCATAGTGG-3'; SEQ. ID. NO. 11, 
oligo 2: 5'-AGTACCACAAGGCCnTCG-3'; SEQ. ID. NO. 12, probe: 5*- 
CTGCGGAACCGGTGAGTACAC-3'; SEQ. ID. NO. 13) or with human GAPDH 
specific oligos/probe (Pre-Developed TaqMan Assay Reagents, Endogenous Control 
30 Human GAPDH, Part Number 43 10884E, Perkin Elmer Biosystems), PGR was 

performed using a Perkin Elmer ABI PRISM 7700 under the following conditions: 30 
minutes at 48''C (the RT step). 10 minutes at 95°C and 40 cycles: 15 seconds at 95'*C 
and 1 minute at 60®C. Quantitative calculations were obtained using the Comparative 
Ct Method (described in User Bulletin #2, ABI PRISM 7700 Sequence Detection 
35 System, Applied Biosystem, Dec 1997) considering the level of GAPDH mRNA 
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constant. All calculations of HCV RNA are expressed as fold difference over a 
specific control. 

Antibodies and Immunological Techniques 
5 Mouse monoclonal antibody (anti-NS3 mablOE5/24) were produced 

by standard techniques. (Galfr6 and Milstein» 1981. Methods in Enzymology 73, 1- 
46.) Purified recombinant protein was used as an immunogen. (Gallinari, et aL, 
1999. Biochemistry 38, 5620-5632.) 

For Cell-EUSA analysis, transfected cells were monitored for 

10 expression of the NS3 protein by EUSA with the anti-NS3 mab lOES/24. Cells were 
seeded into 96 well plates at densities of 40,000» 30,000, 15,000 and 10,000 cells per 
well and fixed with ice-cold isc^ropanol at 1, 2, 3 and 4 days post-transfection, 
respectively. The cells were washed twice with PBS, blocked with 5% non-fat dry 
milk in PBS + 0.1% Triton XlOO + 0.02% SDS (PBSTS) and then incubated 

15 overnight at 4^C with 10E5/24 mab diluted 1 :2000 in Milk/PBSTS. After washing 5 
times with PBSTS, the cells were incubated for 3 hours at room temperature with 
anti-mouse IgG Fc specific alkaline phosphatase conjugated secondary antibody 
(Sigma A-7434X diluted 1:2000 in Milk/PBSTS. After washing again as above, the 
reaction was developed with p-nitrophenyl phosphate disodium substrate (Sigma 104- 

20 105) and the absorbance at 405 nm read at intervals. 

The results were normalized by staining with sulforhodamine B (SRB 
Sigma S 1402) to determine cell numbers. The alkaline phosphatase substrate was 
removed from the wells and the cells washed with PBS. The plates were then 
incubated with 0.4% SRB in 1% acetic acid for 30 minutes (200 ^1/well), rinsed 4 

25 times in 1% acetic acid, blotted dry and then 200 ^1/well of lOmM Tris pH 10.5 
added. After mixing, the absorbance at 570 nm was read. 

Neutral Red/ Crystal Violet Staining of Foci 

The survival of transfected cells in the absence or presence of 04 18 

30 was monitored by staining of foci/clones with neutral red in vivo with subsequent 
crystal violet staining. The medium was removed from the cells and replaced with 
fresh medium containing 0.0025% neutral red (Sigma N2889) and the cells incubated 
for 3 hours at 37°C. Cells were washed twice with PBS, fixed in 3,5% formaldehyde 
for 15 minutes, washed twice again in PBS and then with distilled water and the 

35 number of (live) foci counted. The cells could then be re-stained with crystal violet 
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by incubating with an 0.1% crystal violet (Sigma C077S) solution in 20% methanol 
for 20 minutes at room temperature, followed by 3 washes in 20% methanol and a 
wash with distilled water. 

5 Preparation Of Cells Cured Of Endogenous Replicon 

Replicon enhanced cells designated lOIFN and C1.60/cu were produced 
using different HCV inhibitory agents. Based on the techniques described herein 
additional replicon enhanced clones can readily be obtained. 

lOIFN was obtained by curing a Huh-7 cell of a replicon using human 

10 IFN-cc2b. Huh'7 cells containing HCV replicons (designated HBIIO, HBin4, HBm27 
and HBiniS) were cultured for 1 1 days in the presence of 100 U/ml recombinant 
human IFN-a2b (Intron-A, Schering-PIough)» and subsequently for 4 days in the 
absence of IFN-ot2b. At several time points during this period, the clones were 
analyzed for the presence of HCV proteins and RN A by Western and Northern 

IS blotting. After 7 days of incubation with IFN-a2b, HCV proteins could no longer be 
detected in any of these clones by Western blotting and similar effects were seen with 
RNA signals in Northern blots. IFN-ce2b treated cells were stored in liquid nitrogen 
until used for transfection experiments. 

C1.60/CU was obtained by curing a Huh-7 cell of a replicon using an 

20 HCV inhibitory compound. The presence of HCV RNA was determined using PCR 
(TaqMan) at 4, 9, 12 and 15 days. From day 9 the amount of HCV RNA was below 
the limit of detection. To further test the disappearance of the replicon, 4 million cells 
of cured Clone 60 cells (after the 15 days of treatment) were plated in the presence of 
1 mg/ml G-418. No viable cells were observed, confirming that absence of HCV 

25 replicons able to confer G-4 1 8 resistance. 

Example 2: Selection and Characterization of Cell Clones Containing Functional 
HCV Replicons 

Huh-7 cells (2-8x10^) were transfected by electroporation with in vitro 
30 transcribed replicon RNAs (10-20 ^g), plated at a density ranging from 2.5x10^ to 
10xlO^/cm\ and cultured in the presence of 0.8-1 mg/ml G418. The majority of 
replicon transfected cells became transiently resistant to G418 and duplicated 
normally for 7 to 12 days in the presence of the drug, while cells transfected with 
irrelevant RNAs and mock transfected cells did not survive more than 7 days (data not 
35 shown). Transient resistance to G418 was likely due to persistence of the Neo protein 
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expressed from the transfected RNA, since it was observed also with mutated 
replicons unable to replicate. Approximately 2 weeks after transfection, transient 
resistance declined, most cells died and small colonies of cells permanently resistant 
to the antibiotic became visible in samples transfected with HCVNeo.l? RNA, but 
S not in cells transfected with other replicon RNAs. 

In several experiments, the frequency of G418 resistant clones ranged 
between 10 and 100 clones per 10^ transfected cells. About 20 G418 resistant 
colonies were isolated, expanded and molecularly characterized. PCR and RT-PCR 
analysis of nucleic acids indicated that all clones contained HCV RNA but not HCV 

10 DNA, demonstrating that G418 resistance was due to the presence of functional 

replicons (data not shown). This result was confirmed by Northern blot analysis and 
metabolic labeling with 3H-uridine, which revealed the presence of both genomic and 
antigenomic HCV RNAs of the expected size (data not shown). Lastly, western blot, 
immunoprecipitation and immunofluorescence experiments showed that these clones 

IS expressed all HCV non-structural proteins as well as Neo protein (data not shown). 

Clones differed in terms of cell morphology and growth rate. Replicon 
RNA copy number (500-10000 molecules/cell) and viral protein expression also 
varied between different clones (data not shown). However, the amount of replicon 
RNA and proteins also varied with passages and with culture conditions and was 

20 higher when cells were not allowed to reach confluency, suggesting that replicons 
replicated more efficiently in dividing cells than in resting cells. Processing of the 
viral polyprotein occurred with kinetics similar to those observed in transfected cells. 

Interestingly, in all tested clones HCV replication was efficiently 
inhibited by treating the cells with IFN-a2b. The EC50 was between 1 and 10 U/ml, 

25 depending on the clone. 



Example 3: Identification of Adaptive Mutations 

The low number of G418 resistant clones derived from HCVNeo. 17 
RNA transfection suggested that replication could require mutatipn(s) capable of 
30 adapting the replicon to the host cell (adaptive mutations) and/or that only a small 
percentage of Huh-7 cells were competent for HCV replication. To verify the first 
hypothesis, mutations in replicons RNAs derived from selected cell clones were 
identified. 
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RN A sequences for different leplicons were determined using standard 
techniques. Such techniques involvied isolating RNA from several independent 
clones* reverse transcription to produce cDNA, amplifying cDNAs by PCR and 
cloning into an appropriate vector The cDNA spanning almost the entire HCV NS 
S region (126 bp at the 3* end of the EMCV IRES and 5650 bp of the HCV NS region 
(Le., the entire NS ORF and 298 nucleotides at the 3' end) from 5 clones (HBIIO, 
HBIII12, HBiniS, HB1II27, HBIVl) were recloned and sequenced. In addition, the 
NS5A coding region (nt. 4784-6162) from 3 additional clones (HB 77. HB 68 and HB 
60) were recloned and sequenced. 
10 To discriminate mutations present in the replicon RNA from those 

derived from the cloning procedure, at least 2 isolates derived from independent RT- 
PCR experiments were sequenced for each cell clone. Comparison of the nucleotide 
sequences with the parental sequence indicated that each isolate contained several 
mutations CTables 4A and 4B). 

15 

TABLE 4 A 



Cell clone 


HBm 12 


HBIII 18 


HBI 10 


HBIII 27 


isolate 


4 


29 


28 


61 


12 


43 


13 


72 




1674- 
7460 


1674- 
7460 


1674- 
7460 


1674- 
7460 


1674- 
7460 


1674- 
7460 


1674-7460 


1674-7460 


EMCV 
IRES 
126 bp 


A @ 1736 


A @ 1736 




C 1752 T 








T1678C 


NS3 
1895 bp 


G2009C 
A 2698 G 
G 2764 A 
A 3256 G 
T3273 C 


A2330G 
C2505T 
G 2764 A 
T3085C 


T2150C 
C2J96A 
T 3023 A 
T3134C 
C3267T 


T 2015 C 

A 2338 G 
C26J6T 
A2664G 
A 3148 G 
T 3286 C 
C36I5T 
C3657T 


T 1811 A 
A2330G 
T2666C 
T3395C 


A2330G 
A 2882 G 
T3673C 


G2009C 
T2015C 
C2336G 
A3130T 
A 3401 G 
A 3518 C 


G2009C 

€2052 A 
G2644A 
C 2803 A 
T 2823 A 
T 3692 C 


NS4A 
161 bp 


T3790C 




A 3847 0 


T 3827 A 


T3742C 




A 3743 G 


A 3797 G 


NS4B 
782 bp 


T3869C 
A 4107 G 
T4i85C 
A4428G 


C4283T 
C4429T 


G 4300 A 


A 4136 G 
A 4261 G 
G4309A 
A4449C 


T4290C 


A 4053 G 
A 2496 C 
T4316G 


G 3880 A 
T4200C 
A 4366 G 


C4547T 
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TABLE 4 A 



Cell clone 


HBni 12 


HBUl 18 


HBllO 


HBm27 


isolate 


4 


29 


28 


61 


12 


43 


13 


72 




1674- 


1674* 


1674- 


1674- 


1674- 


1674- 


167d.7d6A 


JO/«t*/*HHf 




7460 


7460 


7460 


7460 


7460 


7460 






NS5A 


A 4847 C 


G 4728 A 


C5243T 


€4729 A 


A4694T 


A4675G 


A 4855 G 


A 4888 G 


1340 bp 


G 5158 A 


A 4845 G 


A 5486 G 


T4993C 


AAA @ 


A 4761 G 


C5(X)6T 


C 4985 T 










4842 










G5J75C 


C5243T 


C5596T 


G5095 A 


T5237C 


AAA@ 


T5318C 


T5030A 














4842 








C5243T 


G5512T 


G5823A 


T5334C 




T5368C 


AS574G 


T5090A 




C5390T 


A 5521 G 




A 5374 T 






G5866A 


T5318C 




A 5719 G 


A5600G 




T 5379 A 








A 5328 G 






A5740C 




T5480C 








A5399G 










A 5513 G 








A 5574 G 










T5977C 










NSSB 


T6316C 


A6406G 


T6074C 


A 6150 G 


A6911 G 


A 5986 G 


G6479C 


G 6156 A 


1477 bp 


T6589C 


G 6756 A 


A 6541 G 


A6218G 




T6099C 


C6870T 


G 7434 A 




T7370C 


G6963T 


A 6732 G 


T 7352 A 




€6141 T 


A 7213 G 


T7444C 








A 7350 T 






G 6463 A 


T7448C 










A 7359 G 






C6849T 


















T 6865 C 







Clone name and isolate number are indicated in the first and second row» respectively. 
The first and the last nucleotide of the region that was recloned and sequenced are indicated in the third 
5 row. 

Nucleotide (lUB code) substitutions are indicated with the original nucleotide, its position and mutated 
nucleotide. 

Nucleotide(s) insertions are indicated with the nucleotide(s), the symbol @ and the position of the 

nucleotide preceding insertion. 
10 Numbering refers to the first nucleotide of the replicon sequence (EMBL-genbank No. AJ242652). 

The region in which mutations are located and the nucleotide length of each region are indicated in the 

left most column. 

Silent mutations arc in italic. 

Non sense mutations are underlined. 
1 S Consensus mutations are bold. 



TABLE 4B 



Cell clone 


HBIVl 


HB77 


HB68 


HB60 


isolate 


85 


93 


10 


14 


42 


1 


13 


7 




1674- 
7460 


1674- 
7460 


4784- 
6162 


4465- 
6162 


4784- 
6162 


4465- 
6162 


47844il62 


4784-6162 


EMCV 
IRES 
126 bp 




A @ 1736 














NS3 
1895 bp 


A 3403 G 


A 2572 G 
A3454G 
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TABLE 4B 



Cell clone 


HBIVl 


HE 77 


HE 68 


HE 60 


isolate 


85 


93 


10 


14 


42 


1 


13 


7 




1674- 
7460 


1674- 
7460 


4784- 
6162 


4465- 
6162 


4784- 
6162 


4465- 
6162 


4784-6162 


4784-6162 


NS4A 
161 bp 


















NS4B 
782 bp 


A4084G 


C3892T 
















NS5A 
1340 bp 


T4742C 
C5315T 
G5431T 


A4847C 
A5225G 
C5315T 
G 5320 A 
T 5356 A 
G 5523 A 
T 5888 A 


C4813T 
O5060C 
C5337A 


A4699C 
AS16iG 
C5337A 
A 5459 G 
T5977C 


T5171G 
C5298T 
C5337A 
A 5639 G 
A5969G 


T4587C 
T4972C 
A5094G 
A 5278 G 
G5320A 
C 5532 7 


A482JG 
G532DA 
A 5414 G 
T5601G 
C5808T 


C5337G 
C5551T 
G 5806 A 


T5751 C 
T5797C 


NS5B 
1477 bp 


T 6144 A 

A 6365 G 
A 6656 G 
A 6677 G 
T6SSSC 
T 6947 A 
T6997C 
G7041 T 


T6S5SC 
A7135G 
T717iC 















See Table 4A legend. 

The frequency of mutations ranged between 1 .7 x 10'^ and 4.5 x 10'^ 
S (average 3 x 10*^). The majority of mutations were nucleotide substitutions, although 
insertions of 1 or more nucleotides were also observed (Tables 4A and 4B). 

Approximately 85% of the mutations found only in 1 isolate (non- 
consensus) were randomly distributed in the lecloned fragment, and possibly include 
mis-incorporation during the PGR amplifications. Conversely, the remaining 15% of 
10 the mutations were common to 2 or more isolates derived from independent RT-PCR 
experiments (consensus mutations), and presumably reflected mutations present in the 
template RNA. 

Consensus mutations were found in all isolates and were either 
common to isolates derived from the same clone (consensus A), or to isolates derived 
15 from different clones (consensus B). Analysis of additional isolates derived from the 
same cell clones indicated that consensus A mutations were not always present in all 
isolates derived from one clone (data not shown). This observation, together with the 
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presence of consensus B mutations, suggests that, even within a single cell clone, 
leplicons exist as quasi-species of molecules with different sequences. 

At variance with non-consensus mutations, consensus mutations were 
not randomly distributed but were clustered in the regions coding for the NSSA 
S protein (frequency 1 x 10'^ and for the NS3 protein (frequency 0.S x 10'^. Only one 
consensus mutation was found in the region coding for the NSSB protein (frequency 
0.1 X 10'^ nucleotides) and none in the regions coding for NS4A and NS4B. 
Interestingly, 1 consensus mutation was observed also in the EMCV IRES. 

With the exception of 2 silent mutations found in NSSA and NSSB, 

10 consensus mutations occurring in the NS region resulted in changes in the deduced 
amino acid sequence (Tables SA and SB). Noticeably, these amino acid changes 
occurred in residues that are conserved in all or most natural HCV isolates. 
Interestingly, clones HB 77 and HB 60 displayed different nucleotide substitutions 
(CS337A and CS337G, respectively) resulting in the same amino acidic mutation (S 

15 2204 R). 



TABLE 5 A 



Cell clone 


HBIII 12 


HBIII 18 


HBI 10 


HBIII 27 


isolate 


4 


29 


28 


61 


12 


43 


13 


72 


NS3 


G109SA 
A1347T 


E1202G 
A 1347 T 






E1202G 


E1202G 


G109SA 


G 1095 A 


NS4A 


















NS4B 


















NSSA 


N2041T 
S2173F 


S2173F 


S2173F 


E2263 


K @ 2039 


K @ 2039 


L2I98S 
R2283R 


L2198S 
R2283R 


NSSB 



















See Table 4A legend. 

20 
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TABLE 5B 



Cell clone 


HBIVI 


HB77 


HB68 


HB60 


isolate 


85 


93 


10 


14 


42 


1 


13 


7 


NS3 


















NS4A 


















NS4B 


















NS5A 


S2197F 


N2041T 

S2197F 
A2199T 


S2204 
R 


S2204 
R 


S2204R 


A 2199 T 


A 2199 T 


S2204R 


NS5B 


N2710N 


N 27 JON 















See Table 4A legend. 



10 



Example 4: Functional Characterization of Consensus Mutations 

The identification of consensus mutations in recloned replicons 
indicated that replication proficiency of replicon RNAs contained in selected cell 
clones depended from the presence of such mutations. To substantiate this 
hypothesis, the effect of several consensus mutations on replication were analyzed. 

Consensus mutations found in the NS5A region were more closely 
analyzed. Consensus mutations were segregated from the non-consensus ones, and 
pHCVNeo.17 derivatives containing single or multiple consensus mutations were 
constructed (Table 6). 



TABLE 6 



Construct 



Consensus mutations 



G418cfu/10^ 
transfected 
cells 



pHCVNcol7.wt 
pHCVNeoI7.GAA 
pHCVNeon.mO 
pHCVNcol7.ml 
pHCVNeol7.m2 
pHCVNeol7.m3 
pHCVNeol7.m4 



NS3 



NS5A 



S2204R 
N2a4IT 
S2I73F 
S2I97F 
L2I98S 



EMCVIRES 



03 
0 

30-130 
0-3 

15-60 
160-500 

30-50 
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TABLE 6 



v.UnSkUllvl 


Consensus mutations 


O4to ClU/lU 

UCllldll^lvU 

cells 




NS3 


NS5A 


EMC VIRES 




pHCVNeol7.m5 




K®2039 




25-55 


pHCVNeol7.in6 


E1202G;A1347T 


S2173F 


Extra A 


13-100 


pHCVNeol7.in7 




N2041T; S2173F 




0-1 


pHCVNeol7.in8 




N2041T; S2197F 




360-500 


pHCVNcol7.m9 




N2041T;L2198S 




140-170 


pHCVNeol7.inlO 


E12Q2G 


K@2039 




1060 


pHCVNeol7.mll 




S2197F; A2199T 




900 


pHCVNeol7.ml2 




N2041T;S2197F;A2199T 




>1000 


pHCVNeoi7.ml3 




N204iT;S2197F 


Extra A 


100 


pHCVNcol7.ml4 




S2197F; A2199T 


Extra A 


>500 


pHCVNeol7.ml5 




A2199T 




300-600 



Huh-7 cells (2x10^) were transfected with 10 \ig of RNA transcribed from the indicated constructs. 
Approximately 2x10^ cells were plated in a 10 cm tissue culture dish and cultured with 1 mg/ml G418 
for 20 days. 

Colonies surviving selection were stained with crysuil violet and counted. 



RNAs transcribed in vitro from these constructs were transfected in 
Huh'7 cells and the affect on replication was estimated by counting neomycin 
resistant colonies (G418 cfu). As shown in Table 6, all but 1 construct containing 
single consensus mutations showed a significant increase on G418 cfu efficiency, thus 
10 indicating that the corresponding mutations improved replication. Noticeably, 2 
mutants containing single mutations in NSSA (m3 and mlS) were clearly more 
effective than all other single mutants. Results of mutants containing 2 or more 
mutations, indicated the presence of a synergistic effect in some combinations (m8, 
m9, mil and possibly mlO), but also a slightly antagonistic effect in 1 mutant (m7). 

15 

Example 5: Replicon Replication in the Absence of Selection 

Replication of HCV replicons in the absence of a G418 selection was 
detected using quantitative PGR (TaqMan). At 24 hours post-transfection a large 
amount of replicon RNA was detected in cells transfected with all replicons, including 
20 the GAA control replicon containing mutations in the catalytic GDD motif of the 

NS5B polymerase. This result suggested that analysis at very early time points (up to 
48 hour post-transfection) essentially measured the input RNA. Northern blot 
analysis also indicated that after 24 hours the majority of the transfected RNA was 
degraded intracellularly (data not shown). 
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Analysis at later time points showed that the amount of leplicon RNA 
was considerably reduced at 4 days and eventually became undetectable (6^8 days) in 
cells transfected with replicon HCVNeolV.wt, but was still high in cells trahsfected 
with leplicons mO, m3 and mlS (Table 7). At day six* that the amount of replicon 
5 RNA became undetectable in cells transfected with replicon HCVNeol7.wt, mO, and 
m2, but was detectable in cells transfected with replicon m3 and mlS (Table 7). 



TABLE 7 



. Name 


HuH7 


RNAequ. 


RNA equ. 




day 4 


day 6 


Wt 


1 X 


1 X 


hcvneol7.m0 


3x 


Ix 


hcvneol7.in2 


Ix 


Ix 


hcvneol7.m3 


5x 


3x 


hcvneol7.inlS 


6x 


5x 



10 

Persistence of mO, m3 and mlS replicons RNA was abolished by 
treatment with interferon-a or with an HCV inhibitory compound (data not shown). 
Moreover, RNA persistence was not observed with mutated replicons carrying the 
NSSB GAA mutation besides adaptive mutations (data not shown). Taken together, 

IS these results demonstrated that quantitative PCR could be used to monitor replication 
at early times post-transfection, and can be used to evaluate the replication proficiency 
of replicon RNAs containing mutations. 

Comparison of the results shown in Tables 6 and 7, indicated that there 
was a good correlation between the amount of replicon RNA detected by TaqMan and 

20 the G418 cfu efficiency. Nonetheless, some mutants (m2, m3) showed a pronounced 
effect on G4I8 cfu efficiency, and little if any effect on early replication as measured 
by TaqMan PCR, while other mutants (mO) showed the reverse behavior. 
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Example 6: HCV Rcplicon Enhanced Cells 

HCV replicon enhanced cells were produced by introducing an HCV 
replicon into a host» then curing the host of the replicon. Adaptive mutations (or 
combinations of them) by themselves increased up to 2 orders of magnitude the G418 
S cfu efficiency and enhanced early replication comparably. Nonetheless, even with the 
most effective mutants, only a small percentage of transfected cells (<:5 %, data not 
shown) gave rise to G418 resistant clones containing functional replicons. Tliis 
observation was attributed, at least in part to a low cloning efficiency of Huh-7 cells 
(data not shown), and only a fraction of Huh-7 cells being competent for replication. 
10 Several clones were cured of endogenous replicons by treating them 

for about 2 weeks with IFN-a or with a HCV inhibitory compound. Analysis at the 
end of the treatment showed that neither viral proteins nor replicon RN A could be 
detected. 

Cured cells (lOIFN and C1.60/cu) were transfected with mutated 
IS replicons and replication efficiency was determined by counting neomycin resistant 
clones (lOIPN) or by TaqMan (lOIFN and CL60/cu). As shown in Table 8, for all. 
tested replicons the G418 cfu efficiency in lOIFN cells was at least S fold higher than 
in parental Huh-7 cells. This increase in 04 18 cfu efficiency was particularly relevant 
for a subset of mutants (m3, m5, m8, m9, ml5). 

20 

TABLES 



Constnict 


Consensus mutations 


G418cfu/10^ 
transfected cells 




NS3 


NS5A 


EMCVIRES 




pHCVNeolT.wt 








12-56 


pHCVNcol7,GAA 








0 


pHCVNeol7.inO 




S2204R 




180- 1000 


pHCVNeol7.inl 




N2041T 




8- 13 


pHCVNcol7.m2 




S2173F 




2000 


pHCVNeol7.m3 




S2197F 




J600 - 3000 


pHCVNeol7.m4 




L2198S 




190-650 


pHCVNeoi7.m5 




K@2039 




1600 - 3000 


pHCVNeol7.m6 


E1202G;A1347T 


S2I73F 


extra A 


600 - 2000 


pHCVNeol7.m7 




N2041T; S2I73F 




170 - 800 


pHCVNeol7.m8 




N204IT;S2197F 




>4000 


pHCVNeol7.m9 




N204IT;L2198S 




1400-3000 


pHCVNcol7.mlO 


EI202G 


K@2039 




>4000 


pHCVNeol7.mLI 




S2197F;A2199T 




>4000 



31 



wo 02/059321 PCT/EP02/00526 



TABLES 



Constntct 



Consensus mutations 



G418 cfu/lO" 
transfected cells 



pHCVNeol7.ml2 
pHCVNcol7.inl3 
pHCVNcol7.fnl4 
pHCVNeol7,mlS 



NS3 NS5A EMCVIRES 

N2041T;S2197F;A2199T 

N204IT;S2197F extra A 

S2197F;A2199T extra A 

A2199T 



>4000 
>4000 
>4000 
>4000 



lOIFN celJs (2x10^ were transfected with 10 of RNA transcribed from the indicated constructs. 
Approximately 2xlO' cells were plated in a 10 cm tissue culture dish and cultured with 1 mg/ml G418 
for 20 days. 

Colonies surviving selection were stained with crystal violet and counted. 



10 



15 



Strikingly, the best mutants yielded a number of G418 resistant clones 
ranging between 20 and 80% of the cell clones which grew in the absence of G418 
(data not shown), thus indicating that the majority of lOIFN cells were competent for 
replication. This result was confirmed by TaqMan analysis (Table 9), in which the 
fold increase versus the parental Huh-7 cells was very high. The data indicates that 
replicons carrying adaptive mutations replicate vigorously in replicon enhanced cells 
such as lOIFN and C1.60/cu. 

TABLE 9 



Name 


lor 


FN 


CI.60/CU. 


RNA equ. 


RNA equ. 


RNA equ. 


RNA equ. 




Day 4 


day 6 


day 4 


Day 6 


Wt 


Ix 


Ix 


1 X 


Ix 


hcvneon.mO 


46x 


12 X 


78 X 


512 X 


hcvneol7.m2 


2x 


2x 


1 X 


2x 


hcvneol7.in3 


68 X 


49 X 


19 X 


392 X 


hcvneol7.ml5 


247 X 


80 X 


268 X 


5518 X 



Expression of viral proteins was determined in replicon enhanced cells 
20 using an ELISA assay designed to detect the NS3 protein in transfected cells plated in 
96 wells microliter plates (Cell-ELISA). As shown in Table 10, 24 hours post- 
transfection cells transfected with all tested replicons expressed low but detectable 
levels of the NS3 protein. 
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TABLE 10 





NS3 aifoitraiy units 




24h[ 


).t. 


96^ 


1 p.t. 


Name 




+ IFN 




+IFN 


Constnict 










Mock 


1 


1 


1 


1 


pHCVNeol7.wt 


3.7 


4.2 


1.2 


1.3 


pHCVNeol7.GAA 


3.1 


3.2 


1.1 


1 


pHCVNeol7.mO 


3.4 


3.2 


9.9 


0.8 


pHCVNeol7.m3 


5.7 


4.6 


4.7 


1.5 


pHCVNeol7.m8 


6.6 


5.1 


15.1 


1.4 


pHCVNeol7jnlO 


8 


5.6 


9.2 


1.8 


pHCVNeol7.inll 


8.4 


6.2 


13.6 


1.8 



lOlFN cells (2x10^ were transfected with 10 of RNA transcribed from the indicated constructs. 
Cells were plated In % wells microtiter plates as indicated in Example 1 . 



5 Where indicated (+IFN), IFN-a (100 U/ml) was added to the culture medium 4 hours post-transfection. 
At the indicated times post-transfection, cells were fixed and analyzed by Cell-EUSA. 

The early expression shown in Table 10 is likely due to translation of 

transfected RNA, since it was comparable in all repltcons (including that carrying the 

GAA mutation) and was not affected by BFN-a At 4 diays post-transfection, NS3 

10 expression persisted or increased in cells transfected with replicons carrying 

consensus mutations, but could not be detected anymore in cells transfected with wt 
and GAA replicons. In addition, NS3 expression was almost completely abolished 
when cells were cultured in the presence of BFN-cl 

Taken together, these results indicated that the level of NS3 expression 

15 reflected the replication rate. Indeed, NS3 expression level (Table 10) paralleled the 
RNA level measured by TaqMan (Table 9). The high replication proficiency of 
lOIFN cells was further confirmed by immunofluorescence experiments which 
showed that more than 50% of cells transfected with replicons m8 and ml 1 expressed 
high level of viral proteins, and that expression was almost completely abolished by 

20 DFN-CL 

Example 7: Replication of Full Length Constructs 

This example illustrates the ability of a full length HCV genome 
containing adaptive mutations described herein to replicate in a replicon enhanced 
25 host cell. The full length sequence of the HCV isolate Con-1 (EMBL-Genbank No. 
AJ238799) (plasmid pHCVRBFLwt) and 2 derivatives containing either the N2041T 
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and S2173 F mutations (plasmid pHCVRBFLmS) or the S2197F and A2199T 
mutations (plasmid pHCVRBFL^ml 1) were used as starting constructs. 

RNAs transcribed from the starting constructs were transfected in 
lOIFN cells and their replication proficiency was assessed by Cell-ELISA, 
5 immunofluorescence and TaqMan. Both constructs containing consensus mutations 
(pHCVRBFL.m8 and pHCVRBHLml 1) replicated, while no sign of replication was 
observed with the wt. construct (data not shown). 

Example 8: Replicons with Reporter Gene 

10 This example illustrates an HCV replicon containing adaptive 

mutations and a reporter gene. A pHCVNeol7.wt derivative where the Neo coding 
region was substituted with that coding for human placental secretory alkaline 
phosphatase (pRBSEAPS.wt) and a derivative also containing the N2041T and 
S2173F mutations (plasmid pRBSEAPS.m8) were constructed. RNAs transcribed 

IS from these plasmids were transfected in lOIFN cells and their replication proficiency 
was assessed by measuring secretion of alkaline phosphatase. Analysis of the kinetics 
of secretion suggested that only plasmid pRBSEAPS.mS was competent for 
replication (data not shown). 

20 Example 9: SEP. JD, Nos. 1 and 2 

SEQ. ID. NOs. 1 and 2 are provided as follows: 

SEP. ID. NO. 1 

MSTNPKPQRKTKRNTNRRI^D\aaT>^ 

25 SERSQPRGRR(2PIPKARQPEGRAWAQPGYPWPLYGNEGLGWAGWLLSPRGS 
RPSWGPTDPRRRSRNLGKVIDTLTCGFADLMGYIPLVGAPLGGAARALAHGV 
RVIJEDGVNYATGNIJHX;SFSIFLLAI1^ 
NASIVYEAADMIMHTPGCVPCVRENNSSRCWVALTFn^^ 
HVDIXVGAAAirSAMYVGDIX:GS\^VAQLFTFSPRRHETVQI^ 

30 VTGHRMAWDMMMNWSPTAALVVSQLIJin>QAVVDMVAGA^ 

YYSMVGNWAKVLIVMlXFAGVDGGTY\rrGGTMAKNTLGITSIJSPGS 
LVNTNGSWHINRTAIJNiCNDSIJSrrGFLAALFYVHKFNSSGCPERMAS 
AQGWGPITYNESHSSDQRPYCWHYAPRPCGIVPAAQVCGPVYCFTPSPVVVG 
TTDRFGVPTYSWGE^mTDVLU.NNTRPPQGNWFGCTWMNSTGF^KTCGGPP 
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CMGGIGNiain'CPTIX:FRKHPEATYTKCGSGPmTPR(X 
TVNFnFKVRMYVGGVEHRl£AACNWTRGERChaJEDRDRSEI^PUX^^ 
QVU>CSFTTlPALSTGIJfflJiQhrNr\a)VQYLYGIGSAWSFAIK^ 
ADARVCACLWMMlXlAQAEAALENLVVLNAASVAGAHGILSFLVFFCAAWY 
5 KGRLVPGAAYALYGVWPllIJLIIJ^PRAYAMDREMAASCGGAVWGULL 
TLSPHYIOJlJUUJWWLQYFnTlAEAHLQVWIPPUJVRGGRDAVI^ 
LIFTITKIIXAIlX3PIJ4VLQAGriXVPYWRAHGURACML\^ 
AmKlJWVLTGTYVYDHLTPIJU)WAHAGLRDlAVA\^PVVreDMETKVI^ 
GADTAACGDnLGLPVSARRGREIHLXJPADSl£GQGWRIJLAPITAYSQQTRGL 
10 LGCnTSLTGRDRNQVEGEVQWSTATQSFIJ^TCVNGVCWTVYHGAGSK^ 
GPKGPITQMYTNVIXJDLVGWQAPPGARSLTPCTXXjSSDLYLVTRHADVIPVR 

rrgdsrgsllsprpvsylkgssggpllcpsghavgifraavctrgvakavdfv 
pvesmettmrspvftonssppavpqtfqvahlhaptgsgkstkvpaayaaqg 
ykvo-vusipsvaatlgfgaymskahgidpnirtgwtittgapitystygkf^ 
15 dggcsggaydmcdechstdsttilgigtvldqaetagarlvvlatatppgsv 
twhpnieeva15stgeipfygkaipietikggrhufchskkkcdelaaklsglg 
u^avayyrgujvsviprsgdvivvatdalmtgfrgdfdsvidcntcvtqtvd 

FSLDPlFllb"lllVPQDAVSRSQRRGRTGRGRMGIYRFVTPGERPSGMFDSSVL 
CECYDAGCAWYELTPAETSVRUIAYLNTPGLPVCQDHLEFWESVFTGLTHID 

20 AHFL^QTKQAGDNFPYLVAYQATVCARAQAPPPSWDQMWKCLIRLKPTLHG 
PTPIXYRLGAVQNEVTTTHPrrKYIMACMSADLEVVrSTWVLVGGVIJWUJ^A 
YCLTTGSVVIVGRIILSGKPAIIPDREVLYREFDEMEECASHLPYIEQGMQLAEQ 
HCQKAIGUXJTATKQAEAAAPVVESKWTLEAFWAKHMWNHSGIQYLAGLS 
TLPGNPAIASlMAFTAS^rePLTTQHTLLF^^IJGGWVAAQlJ^PPSAASAFVGA 

25 lAGAAVGSIGLGKVLVDILAGYGAGVAGALVAFKVMSGEMPSTEDLVNLLPA 
ILSPGALVYGVVCAAILRRHVGPGEGAVQWMNRLIAFASRGNHVSPTHYVPE 
SDAAARVnjn^SLmQLLIOUJIQWlNEDCSTPCSGSWUUJVWDWICn^ 
FKTWlXJSKLLPRUKJVPI^CQRGYKGVWRGIXJlMtFTCPCCAQrrc 
SMRIVGPRTCSNTWHGTFPINAYTTGPCTPSPAPNYSRALWRVAAEEYVEVT 

30 RVGDFHYVTGMTTDhTVKCPCQVPAPEFFTEVDGVRLHRYAPACKPLLREEV 
TFLVGLNQYLVGSQLPCEPEPDVAVLTSMLTDPSHITAETAKRRLARGSPPSL 
ASSSASQLSAPSLKATCTTRHDSPDADLIEANLLWRQEMGGNITRVESENKVV 
ILDSFEPLQAEEDEREVSVPAEILRRSRKFPRAMPIWARPDYNPPLLESWKDPD 
YVPPVVHGCPLPPAKAPPIPPPRRKRTVVLSESTVSSALAELATKTFGSSESSA 

35 VDSGTATASPDQPSDDGDAGSDVESYSSMPPLEGEPGDPDLSDGSWSTVSEE 
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ASEDVVCCSMSYTWTGAIJTPCAAEETKIJ>INA1^NS1^^ 
SLJRQKKVTFDRLQVUJDHYRDVUCE^ 
ARSKTOYGAKDVRhO^iSKAVNHIRSVWKDlJ^T^ 
PEKGGRKPARLTV^DLGVRVCEKMALYDVVSTli^ 
5 VEFLVNAWKAKKCPMGFAYDTRCTDSTVTENDmVEK 
IRSLTERLYIGGPLTNSKGQNCGYRRCRASGVLTTSCGhrn. 
AKLQDCimVCGDDLVVICESAGTQEDEASUlAFIEAMTO 
YDLEUTSCSSNVSVAHDASGKRVYYLTRDPTTPljyi^ 
GNIIMYAFTLWARMIUvlTHFFSnJ^ 
10 RUIGI^AFSUISYSIHSEINRVASCIJIKL^ 

AATCGKYIJWAVRTKIJa.TPIPAASQIJ)IJSSWFVAGYS^ 
RWFMWCLLLLSVGVGIYLLPNR 

SEP. ID. NO. 2: 

IS gccagcccccgattgggggcgacactccaccatagatcactcccctgtgaggaactactgtcttcacgc^^ 

gccatggcgttagtatgagtgtcgtgcagcctccaggaccccccctcccgggagagccatagtggtctgcggaacc^^ 
agtacaccggaattgccaggacgaccgggtcctttcttggatcaacccgctcaatgcctggagatttgggcgtgcccc^^ 
agactgctagccgagtagtgttgggtcgcgaaaggccttgtggtactgcctgatagggtgcttgcgagtgccccgggaggt 
ctcgtagaccgtgcaccatgagcacgaatcctaaacctcaaagaaaaaccaaacgtaacaccaaccgccgcccacagga 

20 cgtcaagttcccgggcggtggtcagatcgtcggtggagtttacctgttgccgcgcaggggccccaggttgggtgtg 

gactaggaagacttccgagcggtcgcaacctcgtggaaggcgacaacctatccccaaggctcgccagcccgagggtagg 
gcctgggctcagcccgggtacccctggcccctctatggcaatgagggcttggggtgggcaggatggctcctgtcaccccgt 
ggctctcggcctagttggggccccacggacccccggcgtaggtcgcgcaaUtgggtaagg^ 
ggcttcgccgatctcatggggtacattccgctcgtcggcgcccccctagggggcgctgccagggcc^^ 

25 (xgggttctggaggacggcgtgaactatgcaacagggaatctgcccggttgctcctmctatcttcc 
gtttgaccatcccagcttccgcttatgaagtgcgcaacgtatccggagtgtaccatgtca^ 
cattgtgtatgaggcagcggacatgatcatgcatacccccgggtgcgtgoxtgcgttcgggagaac^^ 
tgggtagcgctcactcccacgctcgcggccaggaacgctagcgtccccactacgacgatacgacgccatgto 
gttggggcggctgctctctgctccgctatgtacgtgggagatctctgcggatctgttttcctcgtcgcccagctgt^ 

30 gcctcgccggcacgagacagtacaggactgcaattgctcaatatatcccggccacgtgacaggtcaccgtatggcttggga 
tatgatgatgaactggtcacctacagcagccctaglggtatcgcagttactccggatcccacaagctgtcgtggatatggtgg 
cgggggcccattggggagtcctagcgggccttgcctactattccatggtggggaactgggctaaggltctgattgtgatgcta 
ctctttgccggcgttgacgggggaacctatgtgacaggggggacgatggccaaaaacaccctcgggattacgtccctctttt 
cacccgggtcatcccagaaaatccagcttgtaaacaccaacggcagctggcacatcaacaggactgccctgaactgcaat 

35 gaclccctcaacactgggticcttgctgcgctgttctacgtgcacaagttcaactcatctggatgcccagagcgcatggccag 
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ctgcagccccatcgacgcgttcgctcaggggtgggggcccatcacttacaatgagtcacaca 
ttgttggcactacgcaccccggccgtgcggtatcgtacccgcggcgcaggtgtgtggtcca 
cctgtcgtggtggggacgaccgaccggttcggcgtccctacgtacagttggggggagaatgag^^ 
aacaacacgcggccgccgcaaggcaactggtttggctgtacatggatgaatagcactgggttcaccaagacgt^^ 
S ccccc^gtgtaacatcggggggatcggcaataaaaccttgacctgccccacggactgcttccggaagcaccc^ 
cttacaccaagtgtggttcggggccUggttgacacccagatgcttggtccactacccatacaggctU 
actgtcaacUtaccatcttcaaggttaggatgtacgtggggggagtggagcacaggctcgaagccgcatgc^^ 
gaggagagcgttgtaacctggaggacagggacagatcagagcttagc(xgctgctgctgtctacaacggagtggca^ 
UgccctgttccUcaccaccctaccggctctgtccactggmgatccatctccatcagaacgtcgtggacgta^ 

10 acggtatagggtcggcggttgtctcctUgcaatcaaatgggaglatgtcctgUgctcUccttcttctggc 
ctgtgcctgcngtggatgatgctgctgatagctcaagctgaggccgccctagagaaa:tggtggtcctc^ 
gtggccggggcgcatggcattctctccttcctcgtgttcttctgtgctgcctggtacatcaag^ 
ggcatatgccctctacggcgtatggccgctactcctgctcctgctggcgUaccaccacgagcatacgc^^ 
gatggcagcatcgtgcggaggcgcggmtcgtaggtctgatactcttgaccngtcaccgcactataaga 

IS gctcatatggtggttacaatattttatcaccaggg(xgaggcacacttgcaagtgtggatccccrc 
gccgcgatgccgtcatcctcctcacgtgcgcgatccacccagagctaatctttaccatcaccaaaatc^^ 
ggtccactcatggtgctccaggctggtataacxaaagtgccgtacttcgtgcgcgcacacgggctcatte^^ 
ggtgcggaaggttgctgggggtcattatgtccaaatggctctcatgaagttggccgcactgacaggtacgtacgtttatgacc 
atctcaccccactgcgggactgggcccacgcgggcctacgagaccttgcggtggcagttgagcccgtcgtcttctctgatat 

20 ggagaccaaggttatcacctggggggcagacaccgcggcgtgtggggacatcatcttgggcctgcccgtctccgcccgca 
gggggagggagalacatctgggaccggcagacagccttgaagggcaggggtggcgactcctcgcgcctattacggccta 
ctcccaacagacgcgaggcctacttggctgcatcatcactagcctcacaggccgggacaggaaccaggtcgagggggag 
gtccaagtggtctccaccgcaacacaatctttcctggcgacctgcgtcaatggcgtgtgttggactgtctatcatggtgccgg 
ctcaaagacccttgccggcccaaagggcccaatcacccaaatgUicaccaatgtggaccaggacctcgtc^^ 

25 cgccccccggggcgcgctccttgacaccatgcacctgcggcagctcggacctttacttggtc^^ 
tccggtgcgccggcggggcgacagcagggggagcctactctcccccaggcccgtctcctacU 
ggtccactgctctgcccctcggggcacgctgtgggcatcUtegggctgccgtgtgcacccgaggggtt^^ 
gacutgtacccgtcgagtctatggaaaccacutgcggtccccggtcttcacggacaactcgtc^ 
agacattccaggtggcccatctacacgcccctactggtagcggcaagagcactaaggtgccggctgcgtatgc^^ 

30 gggtataaggtgcUgtcctgaacccgtccgtcgccgccaccctaggtUcggggcgtatatgtc^ 

ccctaacatcagaaccggggtaaggaccatcaccacgggtgcccccatcacgtactccacctatggcaagtttcttgccgac 
ggtggttgctctgggggcgcciatgacatcataatatgtgatgagtgccactcaactgactcgaccactatcctgggcatcgg 
cacagtcctggaccaagcggagacggctggagcgcgactcgtcgtgctcgccaccgctacgcctccgggatcggicacc 
gtgccacatccaaacatcgaggaggtggctctgtccagcactggagaaatccccttttatggcaaagccatccccatcgaga 

35 ccatcaagggggggaggcacctcattttctgccattccaagaagaaatgtgatgagctcgccgcgaagclglccggcctcg 
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gactcaatgctgtagcatattaccggggccttgatgtatccgtcataccaactagcggagacgt^ 
gctctaatgacgggctttaccggqgamcgactcagtgatcgactgcaatacatgtgtcacc^ 
ggacccgacctlcaccattgagacgacgaccgtgccacaagacgcggtgtcacgctcgcagcggcgagg^ 
aggggcaggatgggcamacaggmgtgactccaggagaacggccctcgggcatgttcgaltcctcggttclgtgcg 
5 gctatgacgcgggctgtgcttggtacgagctcacgcccgccgagacctcagttaggttgcgggctta^^ 
ggttgcccgtctgccaggaccalctggagttctgggagagcgtctttacaggcctcacxcacatagac^ 
cagactaagcaggcaggagacaacttcccctacctggtagcataccaggctacggtgtgcgccagggctca^^ 
tccatcgtgggaccaaatgtggaagtgtctcatacggctaaagcctacgctgcacgggccaacgcccctgct^^^ 
ggagccgttcaaaacgaggttactaccacacaccccataaccaaatacatcatggcatgcatgtcggct^ 

10 gtcacgagcacctgggtgctggtaggcggagtcctagcagctctggccgcgtattgcctgacaaca 
glgggcaggatcatcngtccggaaagccggccatcattcccgacagggaagtcctttaw 
gagtgcgcctcacacctcccttacatcgaacagggaatgcagctcgccgaiK:aattcaaacagaaggcaatcggg^ 
caaacagccaccaagcaagcggaggctgctgctcccgtggtggaatccaagtggcggaccctcgaagc^^^ 
agcatatgtggaatttcatcagcgggatacaatatnagcaggcttgtccactctgcctggcaaccccgc^ 

IS tggcattcacagcctctatcaccagcccgctcaccacccaacataccctcctgtttaacatcctgggg^ 
ccaacttgctcctcccagcgctgcttclgcmcgtaggcgccggcatcgctggagcggctgttgg^ 
aaggtgcttgtggatamtggcaggttatggagcaggggtggcaggcgcgctcgtggcctttaaggtcalgagcg^^ 
atgccctccaccgaggacctggttaacctactccctgctatcctctcccctggcgccctagtcgtcggggtcgtgtgcgcagc 
gataclgcgtcggcacgtgggcccaggggagggggctgtgcagtggatgaaccggctgatagcgllcgctlcgcggggta 

20 accacgtctcccccacgcactatgtgcctgagagcgacgctgcagcacgtgtcactcagatcctctctagtcttaccatcacl 
cagctgctgaagaggcttcaccagtggatcaacgaggactgctccacgccatgctccggctcgtggctaagagatgtttgg 
gattggatatgcacggtgltgactgatltcaagacctggctccagtccaagctcctgccgcgattgccgggaglcccctlcltc 
tcatgtcaacgtgggtacaagggagtctggcggggcgacggcatcatgcaaaccacxtgcccatgtggagcacagato^^ 
cggacatgtgaaaaacggltccatgaggatcgtggggcctaggacctgtagtaacacgtggcatggaacattccccatto^ 

25 gcgtacaccacgggcccctgcacgccctccccggcgccaaattattctagggcgctgtggcgg^ * 
cg^ggaggttacgcgggtgggggamccactacgtgacgggcatgaccactgacaacgtaaagtgcccgt^^ 
ggcccccgaattcttcacagaagtggatggggtgcggttgcacagguicgctccagcgtgcaaacccc^^ 
aggtcacattcciggtcgggctcaatcaatacctggttgggtcacagctcccatgcgagccc^^ 
cacttccatgctcaccgacccctcccacattacggcggagacggctaagcgtaggctggccaggggatct^ 

30 ggccagctcatcagctagccagctgtctgcgccttccttgaaggcaacatgcactacccgtcalgactccccggacgctgac 
ctcatcgaggccaacctcctgtggcggcaggagatgggcgggaacatcacccgcgtggagtcagaaaaiaaggtagtaat 
tttggactctttcgagccgctccaagcggaggaggatgagagggaagtatccgttccggcggagalcclgcggaggtcca 
ggaaattccctcgagcgatgcccatatgggcacgcccggattacaaccctccactgttagagtcctggaaggacccggacl 
acgtccclccagtggtacacgggtgtccatlgccgcctgccaaggcccctccgataccacctccacggaggaagaggacg 

35 gttgtcctgtcagaatctaccgtgtcltctgccUggcggagctcgccacaaagaccttcggcagclccgaatcgtcggccgt 
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cgacagcggcacggcaacggcctctcctgaccagccctccgacgacggcgacgcgggatccgacgttga^^ 
tccatgcccccccttgagggggagccgggggatcccgatctcagcgacgggtcttggte 
tgaggacgtcgtctgctgctcgatgtcctacacatggacaggcgccctgatcacgcxatgcgctg^ 
gcccatcaatgcactgagcaactcUtgctccgtcaccacaacttggtctatgctacaacatctcgcagcgcaagcctgcggc 
S agaagaaggtcacctttgacagactgcaggtcctggacgaccactaccgggacgtgctcaaggagatgaaggcgaaggc 
gtccacagttaaggctaaacttctatccgtggaggaagcctgtaagctgacgcccccacattcggcc^^ 
atggggcaaaggacgtccggaacctatcxagcaaggccgttaaccacatccgctccgtgtggaaggacttgctggaag 
actgagacaccaattgacaccaccatcatggcaaaaaatgaggttnctgcgtccaaccagagaaggggggccgcaagcc 
agctcgccttatcgtattcccagamgggggttcgtgtgtgcgagaaaatggccctttacgatgtggtct^^ 

10 gccgtgatgggctcncatacggattccaatactctcctggacagcgggtcgagttcctggtg^^ 
aatgccctatgggcttcgcatatgacacccgctgtmgactcaacggtcactgagaa^^ 
accaatgttgtgacnggccc(Xgaagccagacaggccataaggtcgctcacagagcggctttacatcggggg(^ 
ctaattctaaagggcagaactgcggctatcg(xggtgccgcgcgagcggtgtactgacgaccagctgcggta 
catgttacttgaaggccgctgcggcctgtcgagctgcgaagctccaggactgcacgatgctcgtatgc^ 

1 S cgttatctgtgaaagcgcggggacccaagaggacgaggcgagcctacgggccttcacggaggctat^^^ 

ccccccctggggacccgcccaaaccagaatacgacttggagttgataacatcatgctc^ . 
atgcatctggcaaaagggtgtactatctcacccgtgaccccaccaccccccttgcgcgggctgcgtgggagacagctagac 
acactccagtcaattcctggctaggcaacatcatcatgtatgcgcccaccttgtgggcaaggatgatcctgatgactcatttctt 
ctccatccttctagctcaggaacaacttgaaaaagccctagattgtcagatctacggggcctgttactccattgagccacttga 

20 cctacctcagatcattcaacgactccatggccttagcgcattttcactccatagttactctccaggtgagatcaatagggtggct 
tcatgcctcaggaaacttggggtaccgcccttgcgagtctggagacatcgggccagaagtgtccgcgctaggctactgtcc 
c^ggggggg^gggc^S^c^^^S^SS^^^S^^^c^^^^caactgggcagtaaggaccaagctcaaactcactcca^ 
gctgcgtcccagttggatttatccagctggttcgngctggnacagcgggggagacatatatcacagcctgtctcgtgcc 
ccccgctggttcatgtggtgcctacux:tactttctgtaggggtaggcatctatctactcccc^^ 

25 acactccaggccaauggccatcctgttttmcccnttmtmcmmttttmtt^ 

ttcctmctttecUtggtggctccatcttagccctagtcacggctagctgtgaaaggtccgtgag 
gctgatactggcctctctgcagatcaagt 

Other embodiments are within the following claims. While several 
30 embodiments have been shown and described, various modifications may be made 
without departing from the spirit and scope of the present invention. 
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WHAT IS CLAIMED B: 

1 . A nucleic acid molecule comprising a region selected from the 
group consisting of: 

5 a) an altered HCV NS3 encoding region coding for one or more 

NS3 mutations, wherein at least one of said NS3 mutations, identified by reference to 
the amino acid sequence numbering of SEQ. ID. NO. 1, is selected from the group 
consisting of: 

amino acid 109S being Ala, 
10 amino acid 1202 being Gly, and 
amino acid 1347 being Thr; 

b) an altered HCV NSS A encoding region coding for one or more 
NS5A mutations, wherein at least one of said NSSA mutations, identified by reference 
to the amino acid sequence numbering of SEQ. ID. NO. 1, is selected from the group 

15 consisting of: 

amino acid 2041 being Thr, 

a Lys insertion between residue 2039 and 2040. 

amino acid 2173 being Phe, 

amino acid 2197 being Phe, 
20 amino acid 2198 being Ser, 

amino acid 2 199 being Thr, and 

amino acid 2204 being Arg; and 

c) an altered encephalomyocarditis vims (EMCV) internal 
ribosome entry site (IRES) region containing one or more EMCV IRES mutations, 

25 wherein at least one of said EMCV IRES mutations, identified by reference to the 
nucleotide number of SEQ. ID. NO. 3, is an insertion at nucleotide 1736 of adenine. 

2. The nucleic acid molecule of claim 1, wherein said nucleic acid 
molecule comprises said NSSA encoding region. 



30 



3. The nucleic acid molecule of claini 2, wherein at least two of 
said NSSA adaptive mutations are present. 
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4. The nucleic acid molecule of claim 2, further comprising a 
region encoding for a HCV NS3 region, wherein said NS3 region may be the same or 
different than said altered NS3 region. 

5 5. The nucleic acid molecule of claim 4, wherein said nucleic acid 

molecule is an HCV replicon comprising a HCV 5* UTR-PC region, said NS3 
encoding region, an HCV NS4A encoding region, an HCV NS4B encoding region, 
said NSSA encoding region, an HCV NSSB encoding region, and a HCV 3* UTR; 

10 6. The nucleic acid molecule of claim S, wherein said HCV 

replicon further comprises a sequence encoding for a reporter protein. 

7. The nucleic acid molecule of claim 5, wherein isaid HCV 
replicon further comprises a sequence encoding for a selection protein. 

15 

8. The nucleic acid molecule of claim 5, wherein said HCV 
replicon further comprises a HCV core encoding region, a HCV El encoding region, a 
HCV E2 encoding region, a HCV p7 encoding region, and a HCV NS2 encoding 
region. 

20 

9. A nucleic acid molecule comprising a region selected from the 
group consisting of: 

a) an altered HCV NS3 encoding region containing one or more 
NS3 mutations, wherein at least one of said NS3 mutations, identified by reference to 

25 the nucleotide numbering of SEQ. ID. NO. 2, is selected from the group consisting of: 
nucleotide 3625 being cytosine, 
nucleotide 3946 being guanine, 
nucleotide 4380 being adenine, 

b) an altered HCV NSSA encoding region containing one or more 
30 NSSA mutations, wherein at least one of said NSSA mutations, identified by reference 

to the nucleotide numbering of SEQ. ID. NO. 2, is selected from the group consisting 
of: 

an insertion of 3 adenine residues between nucleotide 6458 and 6459, 
nucleotide 6463 being cytosine, 
35 nucleotide 6859 being thymine or uracil, 
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nucleotide 6931 being thymine or uracil, 
nucleotide 6934 being cytosine, 
nucleotide 6936 being adenine, and 
nucleotide 69S3 being adenine or guanine; and 
5 c) an altered encephalomyocarditis vims (EMC V) internal 

ribosome entry site (IRES) region containing one or more EMC V IRES mutations, 
wherein at least one of said EMCV IRES mutations, identified by reference to the 
nucleotide number of SEQ. ID. NO. 3, is an insertion at nucleotide 1736 of adenine. 

10 10. The nucleic acid molecule of claim 9, wherein said molecule 

comprises said altered NS5A encoding region, and the nucleotide sequence of said 
altered NS5 A region is provided for by bases 6258-7598 of SEQ. ID. NO. 2, or the 
RNA version thereof, modified with one or more of said NS5A modifications selected 
from the group consisting of: 

15 an inseition of 3 adenine residues between nucleotide 6458 and 6459, 
nucleotide 6463 being cytosine, 
nucleotide 6859 being thymine or uracil, 
nucleotide 6931 being thymine or uracil, 
nucleotide 6934 being cytosine, 

20 nucleotide 6936 being adenine, and 

nucleotide 6953 being adenine or guanine. 

1 1. The nucleic acid molecule of claim 10, wherein said molecule 
is an HCV replicon comprising a HCV 5* UTR-PC region, a modified HCV NS3- 

25 NS5B region, and a HCV 3' UTR, wherein said modified NS3-NS5B region 
comprises said altered NS5A region. 

12. The nucleic acid molecule of claim 11, wherein said 5' UTR- 
PC region is the RNA version of bases 1-377 of SEQ. ID. NO. 2 and said 3' UTR is 

30 the RNA version of bases 9374-9605 of SEQ. ID. NO. 2. 

13. The nucleic acid molecule of claim 10, wherein said molecule 
is an HCV replicon comprising a HCV 5' UTR-PC region, a modified HCV NS3- 
NS5B region, and a HCV 3' UTR, wherein 

35 said 5' UTR-PC region is the RNA version of bases 1-377 of SEQ. ID. NO. 2; 
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said 3' UTR is the RNA version of bases 9374-9605 of SEQ. ID. NO. 2; and 

said modified NS3-NS5B region consists of the RNA version of bases 3420-9371 of 

SEQ. ID. NO. 2 modified with one or more modifications selected from the group 

■> 

consisting of: 
5 nucleotide 4380 being adenine, 
nucleotide 3625 being cytosine, 
nucleotide 3946 being guanine, 

an inseition of 3 adenine residues between nucleotide 6458 and nucleotide 6459, 
nucleotide 6463 being cytosine, 
10 nucleotide 6859 being uracil, 
nucleotide 6931 being uracil, 
nucleotide 6934 being cytosine, 
nucleotide 6936 being adenine, and 
nucleotide 6953 being adenine or guanine. 

15 

14. The nucleic acid molecule of claim 13, wherein said replicon is 
a genomic replicon that further comprises the RNA version of nucleotides 378-3419 
of SEQ. ID. NO. 2. 

15. A nucleic acid molecule comprising the nucleic acid base 
sequence of bases 1-7989 of SEQ. ID. NO. 3, or the RNA version thereof, consisting 
of one or more different modifications selected from the group consisting of: 

a) nucleotides 5335-5337 modified to code for arginine; 

b) nucleotides 5242-5244 modified to code for phenylalanine; 

c) nucleotides 53 14-53 16 modified to code for phenylalanine; 

d) nucleotides 53 17-53 19 modified to code for serine; 

e) nucleotides coding for lysine inserted after nucleotide 4843; 
0 nucleotides 2329-2331 modified to code for glycine, nucleotides 2764-2766 
modified to code for threonine, nucleotides 5242-5244 modified to code for 
phenylalanine, and an extra adenosine inserted after nucleotide 1736; 

g) nucleotides 4846-4848 modified to code for threonine, and nucleotides 5242-5244 
modified to modified to code for phenylalanine; 

h) nucleotides 4846-4848 modified to code for threonine, and nucleotides 5314-5316 
modified to code for phenylalanine; 



20 



25 



30 
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i) nucleotides 4846-4848 modified to code for threonine, and nucleotides S3 17-53 19 
modified to code for serine; 

j) nucleotides 2329-2331 modified to code for glycine, and nucleotides coding for 
lysine inserted after nucleotides 4843; 
S k) nucleotides 5314-5316 modified\to code for phenylalanine and nucleotides 5320- 
5322 modified to code for threonine; 

1) nucleotides 4846-4848 modified to code for threonine, nucleotides 53 14-53 16 
modified to code for phenylalanine,. and nucleotides 5320-5322 modified to code for 
threonine; 

10 m) nucleotides 4846-4848 modified to code for threonine, nucleotides 5314-5316 
modified to code for phenylalanine, and an extra adenosine inserted after nucleotide 
1736; and 

n) nucleotides 5314-5316 modified to code for phenylalanine, nucleotides 5320-5322 
modified to code for threonine, and an extra adenosine inserted after nucleotide 1736; 
15 and 

0) nucleotides 5320-5322 modified to code for threonine. 

16. The nucleic acid of claim 15, wherein said one or more 
different modifications is selected from the group. consisting of: 
20 a) C5337A; 

b) C5243TorU; 

c) C5315TorU; 

d) TorU5318C; 

e) AAA inserted after 4843; 

25 f) A2330G, G2764A, C5243T or U. and adenosine inserted 1736; 

g) A4847C and C5243T or U; 

h) A4847C and C53 15T or U; 

1) A4847CandTorU5318C; 

j) A2330G and AAA inserted after 4843; 
30 k) C5315T or U and G5320A; 

I) A4847C,C5315TorU,andG5320A; 
m) A4847C, C53 15T or U, and adenosine inserted 1736; 
n) C53 1 5T or U, G5320A and adenosine inserted 1736; and 
o) G5320A. 
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17. The nucleic acid of claim 16, wherein said nucleic acid is RNA 
and comprises said nucleic acid base sequence. 

5 18. The nucleic acid of claim 17, wherein said nucleic acid is RNA 

and consists of said nucleic acid base sequence. 

19. An expression vector comprising a nucleotide sequence coding 
for the nucleic acid molecule of any one of claims 1-18, wherein said nucleotide 

10 sequence is transcriptionally coupled to an exogenous promoter. 

20. A recombinant cell human hepatomia cell, wherein said cell 
comprises the nucleic acid of any one of claims 5-8 and 11-18. 

IS 21 . The recombinant cell of claim 20, wherein said hepatoma cell 

is an Huh-7 cell. 

22. The recombinant cell of claim 20, wherein said cell is derived 
from a Huh-7 cell. 

20 

23. A recombinant cell made by a process comprising the step of 
introducing into a human hepatoma cell the nucleic acid of any one of claims S-8 and 
11-18. 

25 24. A method of making an HCV replicon enhanced cell 

comprising the steps of: 

a) introducing and maintaining a HCV replicon in a cell; and 

b) curing said cell of said HCV replicon to produce said replicon 

enhanced cell. 

30 

25. The method of claim 24, wherein said cell is a human 

hepatoma cell. 

26. The method of claim 24, wherein said cell is a Huh-7 cell or is 
35 derived from a Huh-7 cell. 



45 



wo 02/059321 



PCT/£P02/06S26 



27. The method of claim 26, further comprising the step of 
confirming the ability of said replicon enhanced cell to maintain an HCV replicon. 

5 28 A method of making an HCV replicon enhanced cell containing 

a functional HCV replicon comprising the steps of: 

a) introducing and maintaining a first HCV replicon in a cell; 

b) curing said cell of said first replicon to produce a cured cell; 

and 

10 c) introducing and nuuntaining a second HCV replicon into said 

cured cell, wherein said second HCV replicon may be the same or different than said 
first HCV replicon. 

29 The method of claim 28, wherein said cell is a human 

15 hepatoma ceil. 

30. The method of claim 29, wherein said human hepatoma cell is 

a Huh-7 cell. 

20 31. The method of claim 30, wherein said human hepatoma cell is 

derived from a Huh-7 cell. 

32. An HCV replicon enhanced cell made by the method of any 
one of claims 24-27, 

25 

33. An HCV replicon enhanced cell containing a HCV replicon 
made by the method of any one of claims 28-31. 

34. A method of measuring the ability of a compound to affect 
30 HCV activity comprising the steps of: 

a) providing said compound to the HCV replicon enhanced cell of 

claim 33; and 

b) measuring the ability of said compound to effect one or more 
replicon activities as a measure of the effect on HCV activity. 

35 
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35. The method of claim 34, wherein said compound is a ribozyme. 

36. The method of claim 34, wherein said compound in an 
antisense nucleic acid. 

5 

37. The method of claim 34, wherein compound is an organic 

compound. 

38. The method of claim 34, wherein said step (b) measures HCV 
10 protein production. 

39. The method of claim 33, wherein said step (b) measures 
production of RNA transcripts. 
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1 GCCAGCCCCC GATTGGGGGC GACACTCCAC CATAGATCAC TCCCCTGTGA 

51 GGAACTACTG TCTTCACGCA GAAAGCGTCT A6CCATG6CG TTAGTATC3AG 

101 TGTCGTGCAG CCTCCAGGAC CCCCCCTCCC GGGAGAGCCA TAGTGGTCTG 

151 C6GAACCG6T 6A6TACACCG GAATTGCCAG GACGACCGGG TCCTTTCTTG 

. 201 GATCAACCCG CTCAATGCCT GGAGATTTGG GCGTGCCCCC GCGAGACTGC 

251 TAGCCGAGTA GTGTTGGGTC GCGAAAGGCC TTGTGGTACT GCCTGATAGG 

301 GTGCTTGCGA GTGCCCCGGG AGGTCTCGTA GACCGTGCAC CATGAGCACG 

351 AATCCTAAAC CTCAAAGAAA AACCAAAGGG CGCGCCATGA TTGAACAAGA 

401 TGGATTGCAC GCAGGTTCTC CGGCCGCTTG GGTGGAGAGG CTATTCGGCT 

451 ATGACTGGGC ACAACAGACA ATCGGCTGCT CTGATGCCGC CGTGTTCCGG 

501 CTGTCAGCGC AGGGGCGCCC GGTTCTTTTT GTCAAGACCG ACCTGTCCGG 

551 TGCCCTGAAT GAACTGCAGG ACGAGGCAGC GCGGCTATCG TGGCTGGCCA 

601 CGACGGGCGT TCCTTGCGCA GCT6TGCTCG ACGTTGTCAC TGAAGCGGGA 

651 AGGGACTGGC TGCTATTGGG CGAAGTGCCG GGGCAGGATC TCCTGTCATC 

701 TCACCTTGCT CCTGCC6AGA AAGTATCCAT CATGGCTGAT GCAATGCGGC 

751 GGCTGCATAC GCTTGATCCG GCTACCTGCC CATTCGACCA CCAAGCGAAA 

801 CATCGCATCG AGCGAGCACG TACTCGGATG GAAGCCGGTC TTGTCGATCA 

851 GGATGATCTG GACGAAGAGC ATCAGGGGCT CGCGCCAGCC GAACTGTTCG 

901 CCAGGCTCAA GGCGCGCATG CCCGACGGCG AGGATCTCGT CGTGACCCAT 

951 GGCGATGCCT GCTTGCCGAA TATCATGGTG GAAAATGGCC GCTTTTCTGG 

1001 ATTCATCGAC TGTGGCCGGC TGGGTGTGGC GGACCGCTAT CAGGACATAG 

1051 CGTTGGCTAC CCGTGATATT GCTGAAGAGC TTGGCGGCGA ATGGGCTGAC 

1101 CGCTTCCTCG TGCTTTACGG TATCGCCGCT CCCGATTCGC AGCGCATCGC 

1151 CTTCTATCGC CTTCTT6ACG AGTTCTTCTG AGTTTAAACA GACCACAACG 

1201 GTTTCCCTCT AGCGGGATCA ATTCCGCCCC TCTCCCTCCC CCCCCCCTAA 

1251 CGTTACTGGC CGAAGCXTGCT TGGAATAAGG CCGGTGTGCG TTTGTCTATA 

1301 TGTTATTTTC CACCATATTG CCGTCTTTTG GCAATGTGAG GGCCCGGAAA 

1351 CCTGGCCCTG TCTTCTTGAC GAGCATTCCT AGGGGTCTTT CCCCTCTCGC 

1401 CAAAGGAATG CAAGGTCTGT TGAATGTCGT GAAGGAAGCA GTTCCTCTGG 

1451 AAGCTTCTTG AAGACAAACA ACGTCTGTAG CGACCCTTTG CAGGCAGCGG 

1501 J\ACCCCCCAC CTGGCGACAG GTGCCTCTGC GGCCAAAAGC CACGTGTATA 

FIG. lA 



1/7 



wo 02/059321 PCT/EP02/00526 

1551 AGATACACCT GCAAAGGCGG CACAACCCCA GTGCCACGTT GTGAGTTGGA 

1601 TAGTTGTGGA AAGAGTCAAA TGGCTCTCCT CAAGCGTATT CAACAAGGGG 

1651 CTGAAG6ATG CCCAGAAGGT ACCCCATTGT ATGGGATCTG ATCTGGGGCC 

1701 TCG6TGCACA TGCTTTACAT GTGTTTAGTC 6AGGTTAAAA AACGTCTAGG 

1751 CCCCCCGAAC CACGGGGAC6 TGGTTTTCCT TTGAAAAACA CGATAATACC 

1801 ATGGCGCCTA TTACGGCCTA CTCCCAACAG ACGCGAGQCC TACTTGGCTG 

1851 CATCATCACT AGCCTCACAG GCCGGGACAG GAACCAGGTC GAGGGGGAGG 

1901 TCCAAGTGGT CTCCACCGCA ACACAATCTT TCCTGGCGAC CTGCGTCAAT 

1951 GGCGTGTGTT GGACTGTCTA TCATGGTGCC GGCTCAAAGA CCCTTGCCGG 

2001 CCCAAAGGGC CCAATCACCC AAATGTACAC CAATGTGGAC CAGGACCTCG 

2051 TCGGCTGGCA AGCGCCCCCC GGGGCGCGTT CCTTGACACC ATGCACCTGC 

2101 GGCAGCTCGG ACCTTTACTT GGTCACGAGG CATGCCGATG TCATTCCGGT 

2151 GCGCCGGCGG GGCGACAGCA GGGGGAGCCT ACTCTCCCCC AGGCCCGTCT 

2201 CCTACTTGTU^ GGGCTCTTCG GGCGGTCCAC TGCTCTGCCC CTCGGGGCAC 

2251 GCTGTGGGCA TCTTTCGGGC TGCCGTGTGC ACCCGAGGGG TTGCGAAGGC 

2301 GGTGGACTTT GTACCCGTCG AGTCTATGGA AACCACTATG CGGTCCCCGG 

2351 TCTTCACGGA CAACTCGTCC CCTCCGGCCG TACCGCAGAC ATTCCAGGTG 

2401 GCCCATCTAC ACGCCCCTAC TGGTAGCGGC AAGAGCACTA AGGTGCCGGC 

2451 TGCGTATGCA GCCCAAGGGT ATAAGGTGCT TGTCCTGAAC CCGTCCGTCG 

2501 CCGCCACCCT AGGTTTCGGG GCGTATATGT CTAAGGCACA TGGTATCGAC 

2551 CCTAACATCA GAACCGGGGT AAGGACCATC ACCACGGGTG CCCCCATCAC 

2601 GTACTCCACC TATGGCAAGT TTCTTGCGGA CGGTGGTTGC TCTGGGGGCG 

2651 CCTATGACAT CATAATATGT GATGAGTGCC ACTCAACTGA CTCGACCACT 

2701 ATCCTGGGCA TCGGCACAGT CCTGGACCAA GCGGAGACGG CTGGAGCGCG 

2751 ACTCGTCGTG CTCGCCACCG CTACGCCTCC GGGATCGGTC ACCGTGCCAC 

2801 ATCCAAACAT CGAGGAGGTG GCTCTGTCCA GCACTGGAGA AATCCCCTTT 

2851 TATGGCAAAG CCATCCCCAT CGAGACCATC AAGGGGGGGA GGCACCTCAT 

2901 TTTCTGCCAT TCCAAGAAGA AATGTGATGA GCTCGCCGCG AAGCTGTCCG 

2951 GCCTCGGACT CAATGCTGTA GCATATTACC GGGGCCTTGA TGTATCCGTC 

3001 ATACCAACTA GCGGAGACGT CATTGTCGTA GCAACGGACG CTCTAATGAC 

3051 GGGCTTTACC GGCGATTTCG ACTCAGTGAT CGACTGCAAT ACATGTGTCA 
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JlUl 


CCCAGACAGT CGACTTCAGC CTGGACCCGA CCTTCACCAT TGAGACGAC6 




ACCGTGCCAC AAGACGCGGT GTCACGCTCG CAGCGGCGAG GCAGGACTGG 


TOO"! 


TAGGGGCAG6 ATGGGCATTT ACAGGTTT6T 


GACTCCAGGA 


GAACGGCCCT 




CGGGCATGTT CGATTCCTCG GTTCTGTGCG 


AGTGCTATGA CGCGGGCTGT 




GCTTGGTACG AGCTCACGCC CGCC6AGACC 


TCAGTTAGGT 


TGCGGGCTTA 


"SO CI 

JJ!>1 


CCTAAACACA CCAGGGTTGC CCGTCTGCCA 


GGACCATCTG 


GAGTTCTGGG 


J401 


A6AGCGTCTT TACAGGCCTC ACCCACATAG 


ACGCCCATTT 


CTTGTCCCAG 


34!>1 


ACTAAGCAGG CAGGAGACAA CTTCCCCTAC 


CTGGTAGCAT 


ACCAGGCTAC 




GGTGTGCGCC AGGGCTCAGG CTCCACCTCC 


ATCGTGGGAC 


CAAATGTGGA 


3551 


AGTGTCTCAT ACGGCTAAAG CCTACGCTGC 


ACGGGCCAAC 


GCCCCTGCTG 


3601 


TATAGGCTGG GAGCCGTTCA AAACGAGGTT 


ACTACCACAC 


ACCCCATAAC 


3651 


CAAATACATC ATGGCATGCA TGTCGGCTGA CCTGGAGGTC 


6TCACGAGCA 


3701 


ccmacymcT ocsTACcnccA gtcctagcag 


CTCTGGCCGC 


GTATTGCCTG 


3751 


APAACAfSGPA OCGTGCyrCAT TGTG/GlGlC*AGG 


ATCATCTTGT 


CCGGAAAGCC 


3801 


OfiPPATPATT PCPfiAPACJRG AAf2TCPTTTA 


CCGGGAGTTC 


GATGAGATGG 


3851 


AAGAGTGCGC CTCACACCTC CCTTACATCG 


AACAGGGAAT 


GCAGCTCGCC 


3901 


GAACAATTCA AACAGAAGGC AATCGG6TTG 


CTGCAAACAG 


CCACCAAGCA 


3951 


AGCGGAGGCT GCTGCTCCCG TGGTGGAATC 


CAAGTGGCGG 


ACCCTCGAAG 


4001 


CCTTCTGGGC GAAGCATATG TGGAATTTCA 


TCAGCGGGAT 


ACAATATTTA 


4051 


GCAGGCTTGT CCACTCTGCC TGGCAACCCC 


GCGATAGCAT 


CACTGATGGC 


4101 


ATTCACAGCC TCTATCACCA GCCCGCTCAC 


CACCCAACAT 


ACCCTCCTGT 


4151 


TTAACATCCT GGGGGGATGG GTGGCCGCCC 


AACTTGCTCC 


TCCCAGCGCT 


4201 


GCTTCTGCTT TCGTAGGCGC CGGCATCGCT 


GGAGCGGCTG 


TTGGCAGCAT 


4251 


AGGCCTTGGG AAGGTGCTTG TGGATATTTT 


GGCAGGTTAT GGAGCAGGGG 


4301 


TGGCAGGCGC GCTC6TGGCC TTTT^GGTCA 


TGAGCGGCGA 


GATGCCCTCC 


Jl 1 C 1 

4351 


ACCGAGGACC TGGTTAACCT ACTCCCTGCT 


ATCCTCTCCC 


CTGGCGCCCT 


44U1 


AGTCGTCGGG GTCGTGTGCG CAGC6ATACT 


GCGTCGGCAC 


GTGGGCCCAG 


4451 


GGGAGG6GGC TGTGCAGTGG ATGAACCGGC 


TGATAGCGTT 


CGCTTCGCGG 


4501 


GGTAACCAC6 TCTCCCCCAC GCACTATGTG 


CCTGAGAGCG 


ACGCTGCAGC 


4551 


ACGTGTCACT CAGATCCTCT CTAGTCTTAC 


CATCACTCAG 


CTGCTGAAGA 


4601 


GGCTTCACCA GTGGATCAAC GAGGACTGCT 


CCACGCCATG 


CTCCGGCTCG 
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4651 


TGGCTT^GAG 


ATGTTTGGGA TTGGATATGC ACGGTGTTGA CTGATTTCAA 


4701 


GACCTGGCTC 


CAGTCCAAGC TCCTGCCGCG ATTGCCGGGA 


GTCCCCTTCT 


4751 


TCTCATGTCA 


ACGTGGGTAC AAGGGA6TCT GGCGGGGCGA CG6CATCATG 


4801 


CAAACCACCT 


GCCCATGTGG A6CACAGATC ACCGGACATG TGAAAAACGG 


4851 


TTCCATGAGG 


ATCGTGGGGC CTAGGACCTG TAGTAACACG 


TGGCATGGAA 


4901 


CATTCCCCAT 


TAACGCGTAC ACCACGGGCC CCTGCACGCC 


CTCCCCGGCG 


4951 


CCAT^TTATT 


CTAGGGCGCT GTGGCGGGTG GCTGCTGAGG 


AGTACGTGGA 


5001 


GGTTACGCGG 


GTGGGGGATT TCCACTACGT GACGGGCATG 


ACCACTGACA 


5051 


ACGTAAAGTG 


CCCGTGTCAG GTTCCGGCCC CCGAATTCTT 


CACAGAAGTG 


5101 


GATGGGGTGC 


GGTTGCACAG GTACGCTCCA GCGTGCAAAC 


CCCTCCTACG 


5151 


GGAGGAGGTC 


ACATTCCTG6 TCGGGCTCAA TCAATACCTG GTTGGGTCAC 


5201 


AGCTCCCATG 


CGA6CCCGAA CCGGACGTAG CAGTGCTCAC 


TTCCATGCTC 


5251 


ACCGACCCCT 


CCCACATTAC GGCGGAGACG GCTAAGCGTA. GGCTGGCCAG 


5301 


GGGATCTCCC 


CCCTCCTTGG CCAGCTCATC AGCTAGCCAG 


CTGTCTGCGC 


5351 


CTTCCTTGAA 


GGCAACATGC ACTACCCGTC ATGACTCCCC 


GGACGCTGAC 


5401 


CTCATCGAGG 


CCAACCTCCT GTGGCGGCAG GAGATGGGCG 


GGAACATCAC 


5451 


CCGCGTGGAG 


TCAGAAAATA AGGTAGTAAT TTTGGACTCT 


TTCGAGCCGC 


5501 


TCCAAGCGGA 


GGAGGATGAG AGGGAAGTAT CCGTTCCGGC 


GGAGATCCTG 


5551 


CGGAGGTCCA 


GGAAATTCCC TCGAGCGATG CCCATATGGG 


CACGCCCGGA 


5601 


TTACAACGCT 


CCACTGTTAG AGTCCTGGAA GGACCCGGAC 


TACGTCCCTC 


5651 


CAGTGGTACA 


CGGGTGTCCA TTGCCGCCTG CCAAGGCCCC 


TCCGATACCA 


5701 


CCTCCACGGA 


GGAAGAGGAC GGTTGTCCTG TCAGAATCTA 


CCGTGTCTTC 


5751 


TGCCTTGGCG 


GAGCTCGCCA CAAAGACCTT CGGCAGCTCC 


GAATCGTCGG 


5801 


CCGTCGACAG 


CGGCACGGCA ACGGCCTCTC CTGACCAGCC 


CTCCGACGAC 


5851 


GGCGACGCG6 


GATCCGACGT TGAGTCGTAC TCCTCCATGC 


CCCCCXTTTGA 


5901 


GGGGGAGCCG 


GGGGATCCC6 ATCTCAGCGA CGGGTCTTGG 


TCTACCGTAA 


5951 


GCGAGGAGGC 


TAGTGAGGAC GTCGTCTGCT GCTCGATGTC 


CTACACATG6 


6001 


ACAGGCGCCC 


TGATCACGCC ATGCGCTGCG GAGGAAACCA AGCTGCCCAT 


6051 


CAATGCACTG 


AGCAACTCTT TGCTCCGTCA CCACAACTTG GTCTATGCTA 


6101 


CAACATCTCG 


CAGCGCAAGC CTGCGGCAGA AGAAGGTCAC 


CTTTGACAGA 


6151 


CTGCAGGTCC 


TGGACGACCA CTACCGGGAC GTGCTCAAGG 


AGATGAAGGC 
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6201 


GAAGGC6TCC ACAGTTAAOG CTAAACTTCT ATCCGTGGAG GAAGCCTGTA 


6251 


AGCT6ACGCC CCCACATTCG GCCAGATCTA AATTTGGCTA TGGGGCAAAG 


6301 


GACGTCCGGA ACCTATCCAG 


CAAGGCCGTT AACCAC7VTCC 


GCTCCGTGTG 


6351 


GAAGGACTTG CTG6AAGACA CTGA6ACACC AATTGACACC ACCATCATGG 


6401 


CAAAAAAT6A GGTTTTCTGC 


GTCCAACCAG AGAAGGGGGG 


CCGCAAGCCA 


6451 


GCTCGCCTTA 


TCGTATTCCC 


AGATTTGGGG GTTCGTGTGT 


GCGA6A7UVAT 


6501 


GGCCCTTTAC 


GATGTGGTCT 


CCACCCTCCC TCAGGCCGTG 


ATGGGCTCTT 


6551 


CATACGGATT 


CCAATACTCT 


CCTGGACAGC GGGTCGAGTT 


CCTGGTGAAT 


6601 


GCCTGGAAAG 


CGAAGAAATG 


CCCTATGGGC TTCGCATATG 


ACACCCGCTG 


6651 


TTTTGACTCA ACGGTCACTG AGAATGACAT CCGTGTTGAG 


GAGTCAATCT 


6701 


ACCAATGTTG 


TGACTTGGCC 


CCCGAAGCCA GACAGGCCAT 


AAGGTCGCTC 


6751 


ACAGAGCGGC 


TTTACATCGG 


GGGCCCCCT6 ACTAATTCTA AAGGGCAGAA 


6801 


CTGCGGCTAT 


CGCCGGTGCC 


GCGCGAGCGG TGTACTGAC6 ACCAGCTGCG 


6851 


GTAATACCCT CACATGTTAC 


TTGAAGGCCG CTGCGGCCTG 


TCGAGCTGCG 


6901 


AAGCTCCAGG 


ACTGCACGAT 


GCTCGTATGC GGAGACGACC 


TTGTCGTTAT 


6951 


CTGTGAT^GC 


GCGGGGACCC 


AA6AGGACGA GGCGAGCCTA 


CGGGCCTTCA 


7001 


CGGAGGCTAT 


GACTAGATAC 


TCTGCCCCCC CTGGGGACCC 


GCCCAAACCA 


7051 


GAATACGACT 


TGGAGTTGAT 


AACATCATGC TCCTCCAATG 


TGTCAGTCGC 


7101 


GCACGATGCA 


TCTGGCAAAA 


GGGTGTACTA TCTCACCCGT 


GACCCCACCA 


7151 


CCCCCCTTGC 


GCGGGCTGCG 


TGGGAGACAG CTAGACACAC 


TCCAGTCAAT 


7201 


TCCTGGCTAG 


GCAACATCAT 


CATGTATGCG CCCACCTTGT 


GGGCAAGGAT 


7251 


GATCCTGATG 


ACTCATTTCT 


TCTCCATCCT TCTAGCTCAG 


GAACAACTTG 


7301 


AAAAAGCCCT 


AGATTGTCA6 


ATCTACGGGG CCTGTTACTC 


CATTGAGCCA 


7351 


CTTGACCTAC 


CTCAGATCAT TCAACGACTC CATGGCCTTA 


GCGCATTTTC 


7401 


ACTCCATAGT 


TACTCTCCAG 


GTGAGATCAA TAGGGTGGCT 


TCATGCCTCA 


7451 


GGAAACTTGG GGTACCGCCC TTGCGAGTCT GGAGACATCG GGCCAGAAGT 


7501 


GTCCGCGCTA GGCTACTGTC 


CCAGGGGGGG AGGGCTGCCA 


CTTGTGGCAA 


7551 


GTACCTCTTC 


AACTGGGCAG 


TAAGGACCAA GCTCAAACTC 


ACTCCAATCC 


7601 


CGGCTGCGTC 


CCAGTTGGAT 


TTATCCAGCT GGTTCGTTGC 


TGGTTACAGC 


7651 


GGGGGAGACA 


TATATCACAG 


CCTGTCTCGT GCCCGACCCC 


GCTGGTTCAT 


7701 


GTGGTGCCTA 


CTCCTACTTT 


CTGTAGGGGT AGGCATCTAT 


CTACTCCCCA 
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7751 ACCGATGAAC GGGGAGCTAA ACACTCCAGG CCAATAGGCC ATCCTGTTTT 

7801 TTTCCCTTTT TTTTTTTCTT TTTTTTTTTT TTTTTTTTTT TTT T TTTTTT 

785i TTCTCCTTTT TTTTTCCTCT TTTTTTCCTT TTCTTTCCTT TGGTGGCTCC 

7901 ATCTTAGCCC TAGTCACGGC TAGCTGTGAA AGGTCCGTGA GCCGCTTGAC 

7951 TGCAGAGAGT GCTGATACTG GCCTCTCTGC AGATCAAGTA CTTCTAGAGA 

8001 ATTCTAGCTT GGCGTAATCA TGGTCATAGC TGTTTCCTGT GTGAAATTGT 

8051 TATCAGCTCA CAATTCCACA CAACATACGA GCCGGAAGCA TAAAGTGTAA 

8101 AGCCTGGGAT GCCTAATGAG TGAGCTAACT CACATTAGTT GCGTTGCGCT 

8151 CACTGCCCGC TTTCCAGTCG GGAAACCTGT CGTGCCAGCT CCATTAGTGA 

8201 ATCGTCCAAC GCACGGGGAG AGGCGGTTTG C6TATTGGGC GCACTTCCGC 

8251 TTCCTCGCTC ACT6ACTCGC TGCGCTCGTT C6TTCGGCTG C6GCGAGCCG 

8301 TATCAGCTCA CTCAAAGGCG GTAATACGGT TATCCACAGA ATCAGGGGAT 

8351 AACGCAGGAA AGACCATGTG AGCAAAAGGC CAGCAAAAGG CCAGGAACCG 

8401 TAAAAAGGCC GCGTTGCTGG CGTTTTTCCA TAGGCTCCGC CCCCCTGACG 

8451 AGCATCACAA AAATCGACGC TCAAGTCAGA GGTGGCGAAA CCCGACAGGA 

8501 CTATAAAGAT ACCAGGCGTT TCCCCCTGGA AGCTCCCTCG TGCGCTCTCC 

8551 TGTTCCGACC CTGCCGCTTA CCGGATACCT GTCCGCCTTT CTCCCTTCGG 

8601 GT^GCGTGGC GCTTTCTCAT AGCTCACGCT GTAGGTATCT CAGTTCGGTG 

8651 TAGGTCGTTC GCTCCAAGCT GGGCTGTGTG CACGAACCCC CCGTTCAGCC 

8701 CGACCGCTGC GCCTTATCCG GTAACTATCG TCTTGAGTCC AACCCGGTAA 

8751 GACACGACTT ATCGCCACTG GCAGCAGCCA CTGGTAACAG GATTAGCAGA 

8801 GCGAGGTATG TAGGCGGTGC TACAGAGTTC TTGAAGTGGT GGCCTAACTA 

8851 CGGCTACACT AGAAGGACAG TATTTGGTAT CTGCGCTCTG CTGAAGCCAG 

8901 TTACCTTCGG AAAAAGAGTT 6GTAGCTCTT GATCCGGCAA ACAAACCACC 

8951 GCTGGTAGCG GTGGTTTTTT TGTTTGCAAG CAGCAGATTA CGCGCAGAAA 

9001 AAAAGGATCT CAAGAAGATC CTTTGATCTT TTCTACGGGG TCTGACGCTC 

9051 AGTGGAACGA AAACTCACGT TAAGGGATTT TGGTCATGAG ATTATCAAAA 

9101 AGGATCTTCA CCTAGATCCT TTTAAATTAA AAATGAAGTT TTAAATCAAT 

9151 CTAAAGTATA TATGAGTAAA CTTGGTCTGA CAGTTACCAA TGCTTAATCA 

9201 GTGAGGCACC TATCTCAGCG ATCTGTCTAT TTCGTTCATC CATAGTTGCC 

9251 TGACTCCCCG TCGTGTAGAT AACTACGATA CGGGAGGGCT TACCATCTGG 
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9301 CCCCAGTGCT GCAATGATAC CGCGAGAACC ACGCTCACCC GCACCAGATT 

9351 TATCAGCAAT AAACCAGCCA GCCGGAAGTG CGCTGCGGAG AAGTGGTCCT 

9401 GCAACTTTAT CCGCCTCCAT CCAGTCTATT AGTTGTTGCC GGGAAGCTAG 

9451 AGTAAGTAGT TCGCCAGTCA GCAGTTTGCG TAACGTCGTT GCCATAGCAA 

9501 CAGGCATCGT GGTGTCACGC TCGTCGTTTG GTATGGCTTC ATTCAGCTCC 

9551 GGCTCCCAAC GATCAAGGCG AGTTACATGA TCCC(XATGT TGTGCAAAAA 

9601 AGCGGTTAGC TCCTTCGGTC CTCCGATCGT TGTCAGAAGT AAGTTGGCCG 

9651 CAGTGTTATC ACTCATGGTT ATGGCAGCAC TGCATAATTC TCTTACTGTC 

9701 ATGCCATCCG TAAGATGCTT TTCTGTGACT GGTGAGTACT CAACCAAGTC 

9751 ATTCTGAGAA TAGTGTATGC GGCGACCGA6 TTGCTCTTGC CCGGCGTCAA 

9801 TACGGGATAA TACCGCGCCA CATAGCAGAA CTTTAAAAGT GCTCATCATT 

9851 GGAAAACGTT CTTCGGGGCG AAAACTCTCA AGGATCTTAC CGCTGTTGAG 

9901 ATCCAGTTCG ATGTAACCCA CTCGTGCACC CAACTGATCT TCAGCATCTT 

9951 TTACTTTCAC CAGCGTTTCT GGGTGAGCAA AAACAGGAAG GCAAAATGCC 

10001 GCAAAAAAGG GAATAAGGGC GACACGGAAA TGTTGAATAC TCATACTCTT 

10051 CCTTTTTCAA TATTATTGAA GCATTTATCA GGGTTATTGT CTCATGAGCG 

10101 GATACATATT TGAATGTATT TAGAAAAATA AACAAATAGG GGTTCCGCGC 

10151 ACATTTCCCC GAAAAGTGCC ACCTGACGTC TAAGAAACCA TTATTACCAT 

10201 GACATTAACC TATATU^TA GGCGTATCAC GAAGCCCTTT CGTCTAGCGC 

10251 GTTTCGGTGA TGACGGTGAA AACCTCTGAC ACTTGCAGCT CCCGCAGACG 

10301 GTCACAGCTT GTCTGTAAGC GGATGCCGGG AGCAGGCAAG CCCGTCAGGG 

10351 CGCGTCAGTG GGTGTTGGCG GGTGTCGGGG CTGGCTTAAC TATGCGGCAT 

10401 CAGAGCAGAT TGTACTGAGA GTACACCAGA TGCGGTGTGA AATACCGCAC 

10451 AGATGCGTAA G6AGAAAATA CCGCATCAGC CTCCATTCGC CATTCAGACT 

10501 CCGCAACTGT TGGGAAGGGC GGTCAGTACG CGCTTCTTCG CTATTACGCC 

10551 AACTGGCGAA AGGGGGATGT GCTGCAAGGC GATTAAGTTG GGTAACGCCA 

10601 GGGTTTTCCC AATCACGACG TTGTAAAACG ACAGCCAATG AATTGAAGCT 

10651 TATTAATTCT AGACTGAAGC TTTTAATACG ACTCACTATA (SEQ. ED. NO.:3) 
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SEQUENCE LISTIIIG 

<110> Istituto Di Ricerche Di Biologia Holecolare P. Angeletti S.P.A. 

<120> HEPATITIS C VIRUS REPLICONS AND REPLICON 
ENHANCED CELLS 

<130> IT0003 PCT 

<150> 60/263,479 
<151> 2001-01-23 

<160> 13 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 3010 
<212> PRT 

<213> Con 1 HCV isolate nucleic acid 



<400> 1 

Met Ser Thr Asn Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn 

15 10 15 

Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin He Val Gly 

20 25 30 

Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 

35 40 45 

Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 

50 55 . 60 

He Pro Lys Ala Arg Gin Pro Glu Gly Arg Ala Trp Ala Gin Pro Gly 
65 70 75 80 

Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Leu Gly Trp Ala Gly Trp 

85 . 90 95 

Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro 

100 105 110 

Arg Arg Arg Ser Arg Asn Leu Gly Lys Val He Asp Thr Leu Thr Cys 

115 120 125 

Gly Phe Ala Asp Leu Met Gly Tyr He Pro Leu Val Gly Ala Pro Leu 

130 135 140 

Gly Gly Ala Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp 
145 150 155 160 

Gly Val Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He 

165 170 175 

Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr He Pro Ala Ser Ala Tyr 

180 185 190 

Glu Val Arg Asn Val Ser Gly Val Tyr His Val Thr Asn Asp Cys Ser 

195 200 205 

Asn Ala Ser He Val Tyr Glu Ala Ala Asp Met He Met His Thr Pro 

210 215 220 

Gly Cys Val Pro Cys Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val 
225 230 235 240 

Ala Leu Thr Pro Thr Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr 

245 250 255 

Thir He Arg Arg His Val Asp Leu Leu Val Gly Ala Ala Ala Leu Cys 

260 265 270 

Ser Ala Met Tyr Val Gly Asp Leu Cys Gly Ser Val Phe Leu Val Ala 

275 280 285 

Gin Leu Phe Thr Phe Ser Pro Arg Arg His Glu Thr Val Gin Asp Cys 

290 295 300 

Asn Cys Ser He Tyr Pro Gly His Val Thr Gly His Arg Met Ala Trp 
305 310 315 320 
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Asp Met Met Met Asn Trp Ser Pro Thr Ala Ala Leu Val Val Ser Gin 

325 330 335 

Leu Leu Arg He Pro Gin Ala Val Val Asp Met Val Ala Gly Ala His 

340 345 350 

Trp Gly Val Leu Ala Gly Leu Ala Tyr Tyr Ser Met Val Gly Ash Trp 

355 360 365 

Ala Lys Val Leu He Val Met Leu Leu Phe Ala Gly Val Asp Gly Gly 

370 375 380 

thr Tyr Val Thr Gly Gly Thr Met Ala Lys Asn Thr Leu Gly He Thr 
385 390 395 400 

Ser Leu Phe Ser Pro Gly Ser Ser Gin Lys He Gin Leu Val Asn Thr 

405 410 415 

Asn Gly Ser Trp His He Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser 

420 425 430 

Leu Asn Thr Gly Phe Leu Ala Ala Leu Phe Tyr Val His Lys Phe Asn 

435 440 445 

Ser Ser Gly Cys Pro Glu Arg Met Ala Ser Cys Ser Pro He Asp Ala 

450 455 460 

Phe Ala Gin Gly Trp Gly Pro He Thr Tyr Asn Glu Ser His Ser Ser 
465 470 475 480 

Asp Gin Arg Pro Tyr Cys Trp His Tyr Ala Pro Arg Pro Cys Gly He 

485 490 495 

Val Pro Ala Ala Gin Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser 

500 505 510 

Pro Val Val Val Gly Thr Thr Asp Arg Phe Gly Val Pro Thr Tyr Ser 

515 520 525 

Trp Gly Glu Asn Glu Thr T^p Val Leu Leu Leu Asn Asn Thr Arg Pro 

530 535 540 

Pro Gin Gly Asn Trp Phe Gly Cys Thr Trp Met Asn Ser Thr Gly Phe 
545 550 555 560 

Thr Lys Thr Cys Gly Gly Pro Pro Cys Asn He Gly Gly He Gly Asn 

565 570 575 

Lys Thr Leu Thr Cys Pro Thr Asp Cys Phe Arg Lys His Pro Glu Ala 

580 585 590 

Thr Tyr Thr Lys Cys Gly Ser Gly Pro Trp Leu Thr Pro T^g Cys Leu 

595 600 605 

Val His Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr Val Asn Phe 

610 615 620 

Thr He Phe Lys Val Arg Met Tyr Val Gly Gly Val Glu His Arg Leu 
625 630 635 640 

Glu Ala Ala Cys Asn Trp Thr Arg Gly Glu Arg Cys Asn Leu Glu Asp 

645 650 655 

Arg Asp Arg Ser Glu Leu Ser Pro Leu l.eu Leu Ser Thr Thr Glu Trp 

660 665 670 

Gin Val Leu Pro Cys Ser Phe Thr Thr Leu Pro Ala Leu Ser Thr Gly 

675 680 685 

Leu He His Leu His Gin Asn Val Val Asp Val Gin Tyr Leu Tyr Gly 

690 695 700 

He Gly Ser Ala Val Val Ser Phe Ala He Lys Trp Glu Tyr Val Leu 
705 710 715 720 

Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg Val Cys Ala Cys Leu Trp 

725 730 735 

Met Met Leu Leu He Ala Gin Ala Glu Ala Ala Leu Glu Asn Leu Val 

740 745 750 

Val Leu Asn Ala Ala Ser Val Ala Gly Ala His Gly He Leu Ser Phe 

755 760 765 

Leu Val Phe Phe Cys Ala Ala Trp Tyr He Lys Gly Arg Leu Val Pro 

770 775 780 

Gly Ala Ala Tyr Ala Leu Tyr Gly Val Trp Pro Leu Leu Leu Leu Leu 
785 790 795 800 

Leu Ala Leu Pro Pro Arg Ala Tyr Ala Met Asp Arg Glu Met Ala Ala 
805 . 810 815 
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Ser Cys Gly Gly Ala Val Phe Val Gly Leu lie Leu Leu Thr Leu Ser 

820 825 830 

Pro His Tyr Lys Leu Phe Leu Ala Arg Leu He Tip Trp Leu Gin Tyr 

835 840 845 

Phe He Thr Arg Ala Glu Ala His Leu Gin Val Trp He Pro Pro Leu 

850 855 860 

Asn Val Arg Gly Gly Arg Asp Ala Val He Leu Leu Thr Cys Ala He 
865 870 875 880 

His Pro Glu Leu He Phe Thr He Thr Lys He Leu Leu Ala He Leu 

885 890 895 

Gly Pro Leu Met Val Leu Gin Ala Gly He Thr Lys Val Pro Tyr Phe 

900 905 910 

Val Arg Ala His Gly Leu He Arg Ala Cys Met Leu Val Arg Lys Val 

915 920 925 

Ala Gly Gly His Tyr Val Gin Met Ala Leu Met Lys Leu Ala Ala Leu 

930 935 940 

Thr Gly Thr Tyx Val Tyr Asp His Leu Thr Pro Leu Arg Asp Trp Ala 
945 950 955 960 

His Ala Gly Leu Arg Asp Leu Ala Val Ala Val Glu Pro Val Val Phe 

965 970 975 

Ser Asp Met Glu Thr Lys Val He Thr Trp Gly Ala Asp Thr Ala Ala 

980 985 990 

Cys Gly Asp He He Leu Gly Leu Pro Val Ser Ala Arg Arg Gly Arg 

995 1000 1005 

Glu He His Leu Gly Pro Ala Asp Ser Leu Glu Gly Gin Gly Trp Arg 

1010 1015 1020 

Leu Leu Ala Pro He Thr Ala Tyr Ser Gin Gin Thr Arg Gly Leu Leu 
1025 1030 1035 1040 

Gly Cys He He Thr Ser Leu Thr Gly Arg Asp Arg Asn Gin Val Glu 

1045 1050 1055 

Gly Glu Val Gin Val Val Ser Thr Ala Thr Gin Ser Phe Leu Ala Thr 

1060 1065 1070 

Cys Val Asn Gly Val Cys Trp Thr Val Tyr His Gly Ala Gly Ser Lys 

1075 1080 1085 

Thr Leu Ala Gly Pro Lys Gly Pro He Thr Gin Met Tyr Thr Asn Val 

1090 1095 1100 

Asp Gin Asp Leu Val Gly Trp Gin Ala Pro Pro Gly Ala Arg Ser Leu 
1105 1110 1115 1120 

Thr Pro Cys Thr Cys Gly Ser Ser Asp Leu Tyr Leu Val Thr Arg His 

1125 1130 1135 

Ala Asp Val He Pro Val Arg Arg Arg Gly Asp Ser Arg Gly Ser Leu 

1140 1145 1150 

Leu Ser Pro Arg Pro Val Ser Tyr Leu Lys Gly Ser Ser Gly Gly Pro 

1155 1160 1165 

Leu Leu Cys Pro Ser Gly His Ala Val Gly He Phe Arg Ala Ala Val 

1170 1175 1180 

Cys Thr Arg Gly Val Ala Lys Ala Val Asp Phe Val Pro Val Glu Ser 
1185 1190 1195 1200 

Met Glu Thr Thr Met Arg Ser Pro Val Phe Thr Asp Asn Ser Ser Pro 

1205 1210 1215 

Pro Ala Val Pro Gin Thr Phe Gin Val Ala His Leu His Ala Pro Thr 

1220 1225 1230 

Gly Ser Gly Lys Ser Thr Lys Val Pro Ala Ala Tyr Ala Ala Gin Gly 

1235 1240 1245 

Tyr Lys Val Leu Val Leu Asn Pro Ser Val Ala Ala Thr Leu Gly Phe 

1250 1255 1260 

Gly Ala Tyr Met Ser Lys Ala His Gly He Asp Pro Asn He Arg Thr 
1265 1270 1275 1280 

Gly Val Arg Thr He Thr Thr Gly Ala Pro He Thr Tyr Ser Thr Tyr 

1285 1290 1295 

Gly Lys Phe Leu Ala Asp Gly Gly Cys Ser Gly Gly Ala Tyr Asp He 
1300 1305 1310 
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He He Cys Asp Glu Cys His Ser Thr Asp Ser Thr Thr He Leu Gly 

1315 1320 1325 

He Gly Thr Val Leu Asp Gin Ala Glu Thr Ala Gly Ala Arg Leu Val 

1330 1335 1340 

val Leu Ala thr Ala Thr Pro Pro Gly Ser Val Thr Val Pro His Pro 
1345 1350 1355 1360 

Asn He Glu Glu Val Ala Leu Ser Ser Thr Gly Glu He Pro Phe Tyr 

1365 1370 1375 

Gly Lys Ala He Pro He Glu Thr He Lys Gly Gly Arg His Leu He 

1380 1385 1390 

Phe Cys His Ser Lys Lys Lys Cys Asp Glu Leu Ala Ala Lys Leu Ser 

1395 1400 1405 

Gly Leu Gly Leu Asn Ala Val Ala Tyr Tyr Arg Gly Leu Asp Val Ser 

1410 1415 1420 

Val He Pro Thr Ser Gly Asp Val He Val Val Ala Thr Asp Ala Leu 
1425 1430 1435 1440 

Met Thr Gly Phe Thr Gly Asp Phe Asp Ser Val He Asp Cys Asn Thr 

1445 1450 1455 

Cys Val Thr Gin Thr Val Asp Phe Ser Leu Asp Pro Thr Phe Thr He 

1460 1465 1470 

Glu Thr Thr Thr Val Pro Gin Asp Ala Val Ser Arg Ser Gin Arg Arg 

1475 1480 1485 

Gly Arg Thr Gly Arg Gly Arg Met Gly He Tyr Arg Phe Val Thr Pro 

1490 1495 1500 

Gly Glu Arg Pro Ser Gly Met Phe Asp Ser Ser Val Leu Cys Glu Cys 
1505 1510 1515 1520 

Tyr Asp Ala Gly Cys Ala Trp Tyr Glu Leu Thr Pro Ala Glu Thr Ser 

1525 1530 1535 

Val Arg Leu Arg Ala Tyr Leu Asn Thr Pro Gly Leu Pro Val Cys Gin 

1540 1545 1550 

Asp His Leu Glu Phe Trp Glu Ser Val Phe Thr Gly Leu Thr His He 

1555 1560 1565 

Asp Ala His Phe Leu Ser Gin Thr Lys Gin Ala Gly Asp Asn Phe Pro 

1570 1575 1580 

Tyr Leu Val Ala Tyr Gin Ala Thr Val Cys Ala Arg Ala Gin Ala Pro 
1585 1590 1595 1600 

Pro Pro Ser Trp Asp Gin Met Trp Lys Cys Leu He Arg Leu Lys Pro 

1605 1610 1615 

Thr Leu His Gly Pro Thr Pro Leu Leu Tyr Arg Leu Gly Ala Val Gin 

1620 1625 1630 

Asn Glu Val Thr Thr Thr His Pro He Thr Lys Tyr He Met Ala Cys 

1635 1640 1645 

Met Ser Ala Asp Leu Glu Val Val Thr Ser Thr Trp Val Leu Val Gly 

1650 1655 1660 

Gly Val Leu Ala Ala Leu Ala Ala Tyr Cys Leu Thr Thr Gly Ser Val 
1665 1670 1675 1680 

Val He Val Gly Arg He He Leu Ser Gly Lys Pro Ala He He Pro 

1685 1690 1695 

Asp Arg Glu Val Leu Tyr Arg Glu Phe Asp Glu Met Glu Glu Cys Ala 

1700 1705 1710 

Ser His Leu Pro Tyr He Glu Gin Gly Met Gin Leu Ala Glu Gin Phe 

1715 1720 1725 

Lys Gin Lys Ala He Gly Leu Leu Gin Thr Ala Thr Lys Gin Ala Glu 

1730 1735 1740 

Ala Ala Ala Pro Val Val Glu Ser Lys Trp Arg Thr Leu Glu Ala Phe 
1745 1750 1755 1760 

Trp Ala Lys His Met Trp Asn Phe He Ser Gly He Gin Tyr Leu Ala 

1765 1770 1775 

Gly Leu Ser Thr Leu Pro Gly Asn Pro Ala He Ala Ser Leu Met Ala 

1780 1785 1790 

Phe Thr Ala Ser He Thr Ser Pro Leu Thr Thr Gin His Thr Leu Leu 
1795 1800 1805 
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Phe Asn lie Leu Gly Gly Trp Val Ala Ala Gin Leu Ala Pro Pro Ser 

1810 181S 1820 

Ala Ala Ser Ala Phe Val Gly Ala Gly He Ala Gly Ala Ala Val Gly 
1825 1830 1835 1840 

Ser He Gly Leu Gly Lys Val Leu Val Asp He Leu Ala Gly Tyr Gly 

1845 1850 1855 

Ala Gly Val Ala Gly Ala Leu Val Ala Phe Lys Val Met Ser Gly Glu 

1860 1865 1870 

Met Pro Ser Thr Glii Asp Leu Val Asn Leu Leu Pro Ala He Leu Ser 

1875 1880 1885 

Pro Gly Ala Leu Val Val Gly Val Val Cys Ala Ala He Leu Arg Arg 

1890 1895 . 1900 

His Val Gly Pro Gly Glu Gly Ala Val Gin Trp Met Asn Arg Leu He 
1905 1910 1915 1920 

Ala Phe Ala Ser Arg Gly Asn His Val Ser Pro Thr His Tyr Val Pro 

1925 1930 1935 

Glu Ser Asp Ala Ala Ala Arg Val Thr Gin He Leu Ser Ser Leu Thr 

1940 1945 1950 

He Thr Gin Leu Leu Lys Arg Leu His Gin Trp He Asn Glu Asp Cys 

1955 1960 1965 

Ser Thr Pro Cys Ser Gly Ser Trp Leu Arg Asp Val Trp Asp Trp He 

1970 1975 1980 

Cys Thr Val Leu Thr Asp Phe Lys Thr Trp Leu Gin Ser Lys Leu Leu 
1985 1990 1995 2000 

Pro Arg Leu Pro Gly Val Pro Phe Phe Ser Cys Gin Arg Gly Tyr Lys 

2005 2010 2015 

Gly Val Trp Arg Gly Asp Gly He Met Gin Thr Thr Cys Pro Cys Gly 

2020 2025 2030 

Ala Gin He Thr Gly His Val Lys Asn Gly Ser Met T^g He Val Gly 

2035 2040 2045 

Pro Arg Thr Cys Ser Asn Thr Trp His Gly Thr Phe Pro He Asn Ala 

2050 2055 2060 

Tyr Thr Thr Gly Pro Cys Thr Pro Ser Pro Ala Pro Asn Tyr Ser Arg 
2065 2070 2075 2080 

Ala Leu Trp Arg Val Ala Ala Glu Glu Tyr Val Glu Val Thr Arg Val 

2085 2090 2095 

Gly Asp Phe His Tyr Val Thr Gly Met Thr Thr Asp Asn Val Lys Cys 

2100 2105 2110 

Pro Cys Gin Val Pro Ala Pro Glu Phe Phe Thr Glu Val Asp Gly Val 

2115 2120 2125 

Arg Leu His Arg Tyr Ala Pro Ala Cys Lys Pro Leu Leu Arg Glu Glu 

2130 2135 2140 

Val Thr Phe Leu Val Gly Leu Asn Gin Tyr Leu Val Gly Ser Gin Leu 
2145 2150 2155 2160 

Pro Cys Glu Pro Glu Pro Asp Val Ala Val Leu Thr Ser Met Leu Thr 

2165 2170 2175 

Asp Pro Ser His He Thr Ala Glu Thr Ala Lys Arg Arg Leu Ala Arg 

2180 2185 2190 

Gly Ser Pro Pro Ser Leu Ala Ser Ser Ser Ala Ser Gin Leu Ser Ala 

2195 2200 2205 

Pro Ser Leu Lys Ala Thr Cys Thr Thr Arg His Asp Ser Pro Asp Ala 

2210 2215 2220 

Asp Leu He Glu Ala Asn Leu Leu Trp Arg Gin Glu Met Gly Gly Asn 
2225 2230 2235 2240 

He Thr Arg Val Glu Ser Glu Asn Lys Val Val He Leu Asp Ser Phe 

2245 2250 2255 

Glu Pro Leu Gin Ala Glu Glu Asp Glu Arg Glu Val Ser Val Pro Ala 

2260 2265 2270 

Glu He Leu Arg Arg Ser Arg Lys Phe Pro Arg Ala Met Pro He Trp 

2275 2280 2285 

Ala Arg Pro Asp Tyr Asn Pro Pro Leu Leu Glu Ser Trp Lys Asp Pro 
2290 2295 2300 
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Asp Tyr Val Pro Pro Val Val His Gly Cys Pro Leu Pro Pro Ala Lys 
2305 2310 2315 2320 

Ala Pro Pro lie Pro Pro Pro Arg Arg Lys Arg Thr Val Val Leu Ser 

2325 2330 2335 

Glu Ser Thr Val Ser Ser Ala Leu Ala Glu Leu Ala Thr Lys Thr Phe 

2340 2345 2350 

Gly Ser Ser Glu Ser Ser Ala Val Asp Ser Gly Thr Ala Thr Ala Ser 

2355 2360 2365 

Pro Asp Gin Pro Ser Asp Asp Gly Asp Ala Gly Ser Asp Val Glu Ser 

2370 2375 2380 

Tyr Ser Ser Met Pro Pro Leu Glu Gly Glu Pro Gly Asp Pro Asp Leu 
2385 2390 2395 2400 

Ser Asp Gly Ser Trp Ser Thr Val Ser Glu Glu Ala Ser Glu Asp Val 

2405 2410 2415 

Val Cys Cys Ser Met Ser Tyr Thr Trp Thr Gly Ala Leu lie Thr Pro 

2420 2425 2430 

Cys Ala Ala Glu Glu Thr Lys Leu Pro lie Asn Ala Leu Ser Asn Ser 

2435 2440 2445 

Leu Leu Arg His His Asn Leu Val Tyr Ala Thr Thr Ser Arg Ser Ala 

2450 2455 2460 

Ser Leu Arg Gin Lys Lys Val Thr Phe Asp Arg Leu Gin Val Leu Asp 
2465 2470 2475 2480 

Asp His Tyr Arg Asp Val Leu Lys Glu Met Lys Ala Lys Ala Ser Thr 

2485 2490 2495 

Val Lys Ala Lys Leu Leu Ser Val Glu Glu Ala Cys Lys Leu Thr Pro 

2500 2505 2510 

Pro His Ser Ala Arg Ser Lys Phe Gly Tyr Gly Ala Lys Asp Val Arg 

2515 2520 2525 

Asn Leu Ser Ser Lys Ala Val Asn His He Arg Ser Val Trp Lys Asp 

2530 2535 2540 

Leu Leu Glu Asp Thr Glu Thr Pro He Asp Thr Thr He Met Ala Lys 
2545 2550 2555 2560 

Asn Glu Val Phe Cys Val Gin Pro Glu Lys Gly Gly Arg Lys Pro Ala 

2565 2570 2575 

Arg Leu lie Val Phe' Pro Asp Leu Gly Val Arg Val Cys Glu Lys Met 

2580 2585 2590 

Ala Leu Tyr Asp Val Val Ser Thr Leu Pro Gin Ala Val Met Gly Ser 

2595 2600 2605 

Ser Tyr Gly Phe Gin Tyr Ser Pro Gly Gin Arg Val Glu Phe Leu Val 

2610 2615 2620 

Asn Ala Trp Lys Ala Lys Lys Cys Pro Met Gly Phe Ala Tyr Asp Thr 
2625 2630 2635 2640 

Arg Cys Phe Asp Ser Thr Val Thr Glu Asn Asp lie Arg Val Glu Glu 

2645 2650 2655 

Ser He Tyr Gin Cys Cys Asp Leu Ala Pro Glu Ala Arg Gin Ala He 

2660 2665 2670 

Arg Ser Leu Thr Glu Arg Leu Tyr He Gly Gly Pro Leu Thr Asn Ser 

2675 2680 2685 

Lys Gly Gin Asn Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Leu 

2690 2695 2700 

Thr Thr Ser Cys Gly Asn Thr Leu Thr Cys Tyr Leu Lys Ala Ala Ala 
2705 2710 2715 2720 

Ala Cys Arg Ala Ala Lys Leu Gin Asp Cys Thr Met Leu Val Cys Gly 

2725 2730 2735 

Asp Asp Leu Val Val He Cys Glu Ser Ala Gly Thr Gin Glu Asp Glu 

2740 2745 2750 

Ala Ser Leu Arg Ala Phe Thr Glu Ala Met Thr Arg Tyr Ser Ala Pro 

2755 2760 2765 

Pro Gly Asp Pro Pro Lys Pro Glu Tyr Asp Leu Glu Leu He Thr Ser 

2770 2775 2780 

Cys Ser Ser Asn Val Ser Val Ala His Asp Ala Ser Gly Lys Arg Val 
2785 2790 2795 2800 
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Tyr Tyr Leu Thr Arg Asp Pro Thr Thr Pro Leu Ala Arg Ala Ala Trp 

2805 2810 2815 

Glu Thr Ala Arg His Thr Pro Val Asn Ser Trp Leu Gly Asn lie He 

2820 2825 2830 

Met Tyr Ala Pro Thr Leu Trp Ala Arg Met He Leu Met Thr His Phe 

2835 2840 2845 

Phe Ser He Leu Leu Ala Gin Glu Gin Leu Glu Lys Ala Leu Asp Cys 

2850 2855 2860 

Gin He Tyr Gly Ala Cys Tyr Ser He Glu Pro Leu Asp Leu Pro Gin 
2865 2870 2875 2880 

He He Gin Arg Leu His Gly Leu Ser Ala Phe Ser Leu His Ser Tyr 

2885 2890 2895 

Ser Pro Gly Glu He Asn Arg Val Ala Ser Cys Leu Arg Lys Leu Gly 

2900 2905 2910 

Val Pro Pro Leii Arg Val Trp Arg His Arg Ala Arg Ser Val Arg Ala 

2915 2920 2925 

Arg Leu Leu Ser Gin Gly Gly Arg Ala Ala Thr Cys Gly Lys Tyr Leu 

2930 2935 2940 

Phe Asn Trp Ala Val Arg Thr Lys I^eu Lys Leu Thr Pro He Pro Ala 
2945 2950 2955 2960 

Ala Ser Gin Leu Asp Leu Ser Ser Trp Phe Val Ala Gly Tyr Ser Gly 

2965 2970 2975 

Gly Asp He Tyr His Ser Leu Ser Arg Ala Arg Pro Arg Trp Phe Met 

2980 2985 2990 

Trp Cys Leu Leu Leu Leu Ser Val Gly Val Gly He Tyr Leu Leu Pro 
2995 3000 3005 

Asn Arg 
3010 

<210> 2 
<211> 9605 
<212> DNA 

<213> Con 1 HCV isolate amino acid 
<400> 2 

gccagccccc gattgggggc gacactccac catagatcac tcccctgtga ggaactactg 60 

tcttcacgca gaaagcgtct agccatggcg ttagtatgag tgtcgtgcag cctccaggac 120 

cccccctccc gggagagcca tagtggtctg cggaaccggt gagtacaccg gaattgccag 180 

gacgaccggg tcctttcttg gatcaacccg ctcaatgcct ggagatttgg gcgtgccccc 240 

gcgagactgc tagccgagta gtgttgggtc gcgaaaggcc ttgtggtact gcctgatagg 300 

gtgcttgcga gtgccccggg aggtctcgta gaccgtgcac catgagcacg aatcctaaac 360 

ctcaaagaaa aaccaaacgt aacaccaacc gccgcccaca ggacgtcaag ttcccgggcg 420 

gtggtcagat cgtcggtgga gtttacctgt tgccgcgcag gggccccagg ttgggtgtgc 480 

gcgcgactag gaagacttcc gagcggtcgc aacctcgtgg aaggcgacaa cctatcccca 540 

aggctcgcca gcccgagggt agggcctggg ctcagcccgg gtacccctgg cccctctatg 600 

gcaatgaggg cttggggtgg gcaggatggc tcctgtcacc ccgtggctct cggcctagtt 660 

ggggccccac ggacccccgg cgtaggtcgc gcaatttggg taaggtcatc gataccctca 720 

cgtgcggctt cgccgatctc atggggtaca ttccgctcgt cggcgccccc ctagggggcg 780 

ctgccagggc cctggcgcat ggcgtccggg ttctggagga cggcgtgaac tatgcaacag 840 

ggaatctgcc cggttgctcc ttttctatct tccttttggc tttgctgtcc tgtttgacca 900 

tcccagcttc cgcttatgaa gtgcgcaacg tatccggagt gtaccatgtc acgaacgact 960 

. gctccaacgc aagcattgtg tatgaggcag cggacatgat catgcatacc cccgggtgcg 1020 

tgccctgcgt tcgggagaac aactcctccc gctgctgggt agcgctcact cccacgctcg 1080 

cggccaggaa cgctagcgtc cccactacga cgatacgacg ccatgtcgat ttgctcgttg 1140 

gggcggctgc tctctgctcc gctatgtacg tgggagatct ctgcggatct gttttcctcg 1200 

tcgcccagct gttcaccttc tcgcctcgcc ggcacgagac agtacaggac tgcaattgct 1260 

caatatatcc cggccacgtg acaggtcacc gtatggcttg ggatatgatg atgaactggt 1320 

cacctacagc agccctagtg gtatcgcagt tactccggat cccacaagct gtcgtggata 1380 

tggtggcggg ggcccattgg ggagtcctag cgggccttgc ctactattcc atggtgggga 1440 

actgggctaa ggttctgatt gtgatgctac tctttgccgg cgttgacggg ggaacctatg 1500 

tgacaggggg gacgatggcc aaaaacaccc tcgggattac gtccctcttt tcacccgggt 1560 

catcccagaa aatccagctt gtaaacacca acggcagctg gcacatcaac aggactgccc 1620 
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tgaactgcaa tgactccctc aacactgggt tccttgctgc gctgttctac gtgcacaagt 1680 

tcaactcatc tggatgccca gagcgcatgg ccagctgcag ccccatcgac gcgttcgctc 1740 

s^rgggtgggg gcccatcact tacaatgagt cacacagctc ggaccagagg ccttattgtt 1800 

ggcactacgc accccggccg tgcggtatcg tacccgcggc gcaggtgtgt ggtccagtgt 1860 

actgcttcac cccaagccct gtcgtggtgg ggacgaccga ccggttcggc gtccctacgt 1920 

acagttgggg ggagaatgag acggacgtgc .tgcttcttaa caacacgcgg ccgccgcaag 1980 

gcaactggtt tggctgtaca tggatgaata gcactgggtt caccaagacg tgcgggggcc 2040 

ccccgtgtaa catcgggggg atcggcaata aaaccttgac ctgccccacg gactgcttcc .2100 

ggaagcaccc cgaggccact tacaccaagt gtggttcggg gccttggttg acacccagat 2160 

gcttggtcca ctacccatac aggctttggc actacccctg cactgtcaac tttaccatct 2220 

tcaaggttag gatgtacgtg gggggagtgg agcacaggct cgaagccgca tgcaattgga 2280 

ctcgaggaga gcgttgtaac ctggaggaca gggacagatc agagcttagc ccgctgctgc 2340 

tgtctacaac ggagtggcag gtattgccct gttccttcac caccctaccg gctctgtcca 2400 

ctggtttgat ccatctccat cagaacgtcg tggacgtaca atacctgtac ggtatagggt 2460 

cggcggttgt ctcctttgca atcaaatggg agtatgtcct gttgctcttc cttcttctgg 2520 

cggacgcgcg cgtctgtgcc tgcttgtgga tgatgctgct gatagctcaa gctgaggccg 2580 

ccctagagaa cctggtggtc ctcaacgcgg catccgtggc cggggcgcat ggcattctct 2640 

ccttcctcgt gttcttctgt gctgcctggt acatcaaggg caggctggtc cctggggcgg 2700 

catatgccct ctacggcgta tggccgctac tcctgctcct gctggcgtta ccaccacgag 2760 

catacgccat ggaccgggag atggcagcat cgtgcggagg cgcggttttc gtaggtctga 2820 

tactcttgac cttgtcaccg cactataagc tgttcctcgc taggctcata tggtggttac 2880 

aatattttat caccagggcc gaggcacact tgcaagtgtg gatccccccc ctcaacgttc 2940 

gggggggccg cgatgccgtc atcctcctca cgtgcgcgat ccacccagag ctaatcttta 3000 

ccatcaccaa aatcttgctc gccatactcg gtccactcat ggtgctccag gctggtataa 3060 

ccaaagtgcc gtacttcgtg cgcgcacacg ggctcattcg tgcatgcatg ctggtgcgga 3120 

aggttgctgg gggtcattat gtccaaatgg ctctcatgaa gttggccgca ctgacaggta 3180 

cgtacgttta tgaccatctc accccactgc gggactgggc ccacgcgggc ctacgagacc 3240 

ttgcggtggc agttgagccc gtcgtcttct ctgatatgga gaccaaggtt atcacctggg 3300 

gggcagacac cgcggcgtgt ggggacatca tcttgggcct gcccgtctcc gcccgcaggg 3360 

ggagggagat acatctggga ccggcagaca gccttgaagg gcaggggtgg cgactcctcg 3420 

cgcctattac ggcctactcc caacagacgc gaggcctact tggctgcatc atcactagcc 3480 

tcacaggccg ggacaggaac caggtcgagg gggaggtcca agtggtctcc accgcaacac 3540 

aatctttcct ggcgacctgc gtcaatggcg tgtgttggac tgtctatcac ggtgccggct 3600 

caaagaccct tgccggccca aagggcccaa tcacccaaat gtacaccaat gtggaccagg 3660 

acctcgtcgg ctggcaagcg ccccccgggg cgcgttcctt gacaccatgc acctgcggca 3720 

gctcggacct ttacttggtc acgaggcatg ccgatgtcat tccggtgcgc cggcggggcg 3780 

acagcagggg gagcctactc tcccccaggc ccgtctccta cttgaagggc tcttcgggcg 3840 

gtccactgct ctgcccctcg gggcacgctg tgggcatctt tcgggctgcc gtgtgcaccc 3900 

gaggggttgc gaaggcggtg gactttgtac ccgtcgagtc tatggaaacc actatgcggt 3960 

ccccggtctt cacggacaac tcgtcccctc cggccgtacc gcagacattc caggtggccc 4020 

atctacacgc ccctactggt agcggcaaga gcactaaggt gccggctgcg tatgcagccc 4080 

aagggtataa ggtgcttgtc ctgaacccgt ccgtcgccgc caccctaggt ttcggggcgt 4140 

atatgtctaa ggcacatggt atcgacccta acatcagaac cggggtaagg accatcacca 4200 

cgggtgcccc catcacgtac tccacctatg gcaagtttct tgccgacggt ggttgctctg 4260 

ggggcgccta tgacatcata atatgtgatg agtgccactc aactgactcg accactatcc 4320 

tgggcatcgg cacagtcctg gaccaagcgg agacggctgg agcgcgactc gtcgtgctcg 4380 

ccaccgctac gcctccggga tcggtcaccg tgccacatcc aaacatcgag gaggtggctc 4440 

tgtccagcac tggagaaatc cccttctatg gcaaagccat ccccatcgag accatcaagg 4500 

g9g99aggca cctcattttc tgccattcca agaagaaatg tgatgagctc gccgcgaagc 4560 

tgtccggcct cggactcaat gctgtagcat attaccgggg ccttgatgta tccgtcatac 4620 

caactagcgg agacgtcatb gtcgtagcaa cggacgctct aatgacgggc tttaccggcg 4680 

atttcgactc agtgatcgac tgcaatacat gtgtcaccca gacagtcgac ttcagcctgg 4740 

acccgacctt caccattgag acgacgaccg tgccacaaga cgcggtgtca cgctcgcagc 4800 

ggcgaggcag gactggtagg ggcaggatgg gcatttacag gtttgtgact ccaggagaac 4860 

ggccctcggg catgttcgat tcctcggttc tgtgcgagtg ctatgacgcg ggctgtgctt 4920 

ggCacgagct cacgcccgcc gagacctcag ttaggttgcg ggcttaccta aacacaccag 4980 

ggttgcccgt ctgccaggac catctggagt tctgggagag cgtctttaca ggcctcaccc 5040 

acatagacgc ccatttcttg tcccagacta agcaggcagg agacaacttc ccctacctgg 5100 

tagcatacca ggctacggtg tgcgccaggg ctcaggctcc acctccatcg tgggaccaaa 5160 

tgtggaagtg tctcatacgg ctaaagccta cgctgcacgg gccaacgccc ctgctgtata 5220 

ggctgggagc cgttcaaaac gaggttacta ccacacaccc cataaccaaa tacatcatgg 5280 

catgcatgtc ggctgacctg gaggtcgtca cgagcacctg ggtgctggta ggcggagtcc 5340 
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tagcagcCct ggccgcgtat tgcctgacaa caggcagcgt ggtcattgtg ggcaggatca 5400 

tcttgtccgg aaagccggcc atcattcccg acagggaagt cctttaccgg gagttcgatg 5460 

agatggaaga gtgcgcctca cacctccctt acatcgaaca gggaatgcag ctcgccgaac 5520 

aattcaaaca gaaggcaatc gggttgctgc aaacagccac caagcaagcg gaggctgctg 5580 

ctcccgtggt ggaatccaag tggcggaccc tcgaagcctt ctgggcgaag catatgtgga 5640 

atttcatcag cgggatacaa tatttagcag gcttgtccac tctgcctggc aaccccgcga 5700 

tagcatcact gatggcattc acagcctcta tcaccagccc gctcaccacc caacataccc 5760 

tcctgtttaa catcctgggg ggatgggtgg ccgcccaact tgctcctccc agcgctgctt 5820 

ctgctttcgt aggcgccggc atcgctggag cggctgttgg cagcataggc cttgggaagg 5880 

tgcttgtgga tattttggca ggttatggag caggggtggc aggcgcgctc gtggccttta 5940 

aggtcatgag cggcgagatg ccctccaccg aggacctggt taacctactc cctgctatcc 6000 

tctcccctgg cgccctagtc gtcggggtcg tgtgcgcagc gatactgcgt cggcacgtgg 6060 

gcccagggga gggggctgtg cagtggatga accggctgat agcgttcgct tcgcggggta 6120 

accacgtctc ccccacgcac tatgtgcctg agagcgacgc tgcagcacgt gtcactcaga 6180 

tcctctctag tcttaccatc actcagctgc tgaagaggct tcaccagtgg atcaacgagg 6240 

actgctccac gccatgctcc ggctcgtggc taagagatgt ttgggattgg atatgcacgg 6300 

tgttgactga tttcaagacc tggctccagt ccaagctcct gccgcgattg ccgggagtcc 6360 

ccttcttctc atgtcaacgt gggtacaagg gagtctggcg gggcgacggc atcatgcaaa 6420 

ccacctgccc atgtggagca cagatcaccg gacatgtgaa aaacggttcc atgaggatcg 6480 

tggggcctag gacctgtagt aacacgtggc atggaacatt ccccattaac gcgtacacca 6540 

cgggcccctg cacgccctcc ccggcgccaa attattctag ggcgctgtgg cgggtggctg 6600 

ctgaggagta cgtggaggtt acgcgggtgg gggatttcca ctacgtgacg ggcatgacca 6660 

ctgacaacgt aaagtgcccg tgtcaggttc cggcccccga attcttcaca gaagtggatg 6720 

gggtgcggtt gcacaggtac gctccagcgt gcaaacccct cctacgggag gaggtcacat 6780 

tcctggtcgg gctcaatcaa tacctggttg ggtcacagct cccatgcgag cccgaaccgg 6840 

acgtagcagt gctcacttcc atgctcaccg acccctccca cattacggcg gagacggcta 6900 

agcgtaggct ggccagggga tctcccccct ccttggccag ctcatcagct agccagctgt 6960 

ctgcgccttc cttgaaggca acatgcacta cccgtcatga ctccccggac gctgacctca 7020 

tcgaggccaa cctcctgtgg cggcaggaga tgggcgggaa catcacccgc gtggagtcag 7080 

aaaataaggt agtaattttg gactctttcg agccgctcca agcggaggag gatgagaggg 7140 

aagtatccgt tccggcggag atcctgcgga ggtccaggaa attccctcga gcgatgccca 7200 

tatgggcacg cccggattac aaccctccac tgttagagtc ctggaaggac ccggactacg 7260 

tccctccagt ggtacacggg tgtccattgc cgcctgccaa ggcccctccg ataccacctc 7320 

cacggaggaa gaggacggtt gtcctgtcag aatctaccgt gtcttctgcc ttggcggagc 7380 

tcgccacaaa gaccttcggc agctccgaat cgtcggccgt cgacagcggc acggcaacgg 7440 

cctctcctga ccagccctcc gacgacggcg acgcgggatc cgacgttgag tcgtactcct 7500 

ccatgccccc ccttgagggg gagccggggg atcccgatct cagcgacggg tcttggtcta 7560 

ccgtaagcga ggaggctagt gaggacgtcg tctgctgctc gatgtcctac acatggacag 7620. 

gcgccctgat cacgccatgc gctgcggagg aaaccaagct gcccatcaat gcactgagca 7680 

actctttgct ccgtcaccac aacttggtct atgctacaac atctcgcagc gcaagcctgc 7740 

ggcagaagaa ggtcaccttt gacagactgc aggtcctgga cgaccactac cgggacgtgc 7800 

tcaaggagat gaaggcgaag gcgtccacag ttaaggctaa acttctatcc gtggaggaag 7860 

cctgtaagct gacgccccca cattcggcca gatctaaatt tggctatggg gcaaaggacg 7920 

tccggaacct atccagcaag gccgttaacc acatccgctc cgtgtggaag gacttgctgg 7980 

aagacactga gacaccaatt gacaccacca tcatggcaaa aaatgaggtt ttctgcgtcc 8040. 

aaccagagaa ggggggccgc aagccagctc gccttatcgt attcccagat ttgggggttc 8100 

9tgtgtgcga gaaaatggcc ctttacgatg tggtctccac cctccctcag gccgtgatgg 8160 

gctcttcata cggatcccaa tactctcctg gacagcgggt cgagttcctg gtgaatgcct 8220 

ggaaagcgaa gaaatgccct atgggcttcg catatgacac ccgctgtttt gactcaacgg 8280 

tcactgagaa tgacatccgt gttgaggagt caatctacca atgttgtgac ttggcccccg 8340 

aagccagaca ggccataagg tcgctcacag agcggcttta catcgggggc cccctgacta 8400 

attctaaagg gcagaactgc ggctatcgcc ggtgccgcgc gagcggtgta ctgacgacca 8460 

gctgcggtaa taccctcaca tgttacttga aggccgctgc ggcctgtcga gctgcgaagc 8520 

tccaggactg cacgatgctc gtatgcggag acgaccttgt cgttatctgt gaaagcgcgg 8580 

ggacccaaga ggacgaggcg agcctacggg ccttcacgga ggctatgact agatactctg 8640 

ccccccctgg ggacccgccc aaaccagaat acgacttgga gttgataaca tcatgctcct 8700 

ccaatgtgtc agtcgcgcac gatgcatctg gcaaaagggt gtactatctc acccgtgacc 8760 

ccaccacccc ccttgcgcgg gctgcgtggg agacagctag acacactcca gtcaattcct 8820 

ggctaggcaa catcatcatg tatgcgccca ccttgtgggc aaggatgatc ctgatgactc 8880 

atttcttctc catccttcta gctcaggaac aacttgaaaa agccctagat tgtcagatct 8940 

acggggcctg ttactccatt gagccacttg acctacctca gatcattcaa cgactccatg 9000 

gccttagcgc attttcactc catagttact ctccaggtga gatcaatagg gtggcttcat 9060 
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gcctcaggaa acttggggta ccgcccttgc gagtctggag acatcgggcc agaagtgtcc 9120 

gcgctaggct actgtcccag ggggggaggg ctgccacttg tggcaagtac ctcttcaact 9180 

gggcagtaag gaccaagctc aaactcactc caatcccggc tgcgtcccag ttggatttat 9240 

ccagctggtt cgttgctggt tacagcgggg gagacatata tcacagcctg tctcgtgccc 9300 

gaccccgctg gttcatgtgg tgcctactcc tactttctgt aggggtaggc atctatctac 9360 

Cccccaaccg atgaacgggg agctaaacac tccaggccaa taggccatcc tgtttttttc 9420 

cctttttttt tttctttttt tttttttttt tttttttttt ttttttttct cctttttttt 9480 

tcctcttttt ttccttttct ttcctttggt ggctccatct tagccctagt cacggctagc 9540 

tgtgaaaggt ccgtgagccg cttgactgca gagagtgctg atactggcct ctctgcagat 9600 

caagt 9605 



<210> 3 
<211> 10690 
<212> DNA 

<213> pHCVNeo.17 coding 



<400> 3 

gccagccccc 

tcttcacgca 

cccccctccc 

gacgaccggg 

gcgagactgc 

gtgcttgcga 

ctcaaagaaa 

cggccgcttg 

ctgatgccgc 

acctgtccgg 

cgacgggcgt 

tgctattggg 

aagtatccat 

cattcgacca 

ttgtcgatca 

ccaggctcaa 

gcttgccgaa 

tgggtgtggc 

ttggcggcga 

agcgcatcgc 

gtttccctct 

cgaagccgct 

ccgtcttttg 

aggggtcttt 

gttcctctgg 

aaccccccac 

gcaaaggcgg 

tggctctcct 

atgggatctg 

aacgtctagg 

atggcgccta 

agccCcacag 

acacaatctt 

ggctcaaaga 

caggacctcg 

ggcagctcgg 

ggcgacagca 

ggcggtccac 

acccgagggg 

cggtccccgg 

gcccatctac 

gcccaagggt 

gcgtatatgt 



gattgggggc 
gaaagcgtct 
gggagagcca 
tcctttcttg 
tagccgagta 
gtgccccggg 
aaccaaaggg 
ggtggagagg 
cgtgttccgg 
tgccctgaat 
tccttgcgca 
cgaagtgccg 
catggctgat 
ccaagcgaaa 
ggatgatctg 
ggcgcgcatg 
tatcatggtg 
ggaccgctat 
atgggctgac 
cttctatcgc 
agcgggatca 
tggaataagg 
gcaatgtgag 
cccctctcgc 
aagcttcttg 
ctggcgacag 
cacaacccca 
caagcgtatt 
atctggggcc 
ccccccgaac 
ttacggccta 
gccgggacag 
tcctggcgac 
cccttgccgg 
tcggctggca 
acctctactt 

tgctctgccc 
ttgcgaaggc 
tcttcacgga 
acgcccctac 
ataaggtgct 
ctaaggcaca 



gacactccac 

agccatggcg 

tagtggtctg 

gatcaacccg 

gtgttgggtc 

aggtctcgta 

cgcgccatga 

ctattcggct 

ctgtcagcgc 

gaactgcagg 

gctgtgctcg 

gggcaggatc 

gcaatgcggc 

catcgcatcg 

gacgaagagc 

cccgacggcg 

gaaaatggcc 

caggacatag 

cgcttcctcg 

cttcttgacg 

attccgcccc 

ccggtgtgcg 

ggcccggaaa 

caaaggaatg 

aagacaaaca 

gtgcctctgc 

gtgccacgtt 

caacaagggg 

tcggtgcaca 

cacggggacg 

ctcccaacag 

gaaccaggtc 

ctgcgtcaat 

cccaaagggc 

agcgcccccc 

ggtcacgagg 

actctccccc 

ctcggggcac 

ggtggacttt 

caactcgtcc 

tggtagcggc 

tgtcctgaac 

tggtatcgac 



catagatcac 

ttagtatgag 

cggaaccggt 

ctcaatgcct 

gcgaaaggcc 

gaccgtgcac 

ttgaacaaga 

atgactgggc 

aggggcgccc 

acgaggcagc 

acgttgtcac 

tcctgtcatc 

ggctgcatac 

agcgagcacg 

atcaggggct 

aggatctcgt 

gcttttctgg 

cgttggctac 

tgctttacgg 

agttcttctg 

tctccctccc 

tttgtctata 

cctggccctg 

caaggtctgt 

acgtctgtag 

ggccaaaagc 

gtgagctgga 

ctgaaggatg 

tgctttacat 

tggttttcct 

acgcgaggcc 

gagggggagg 

ggcgtgtgct 

ccaatcaccc 

ggggcgcgtt 

catgccgatg 
aggcccgtct 
gctgtgggca 
gtacccgtcg 
cctccggccg 
aagagcacta 
ccgtccgtcg 
cctaacatca 



tcccctgtga 

tgtcgtgcag 

gagtacaccg 

ggagatttgg 

ttgtggtact 

catgagcacg 

tggattgcac 

acaacagaca 

ggttcttttt 

gcggctatcg 

tgaagcggga 

tcaccttgct 

gcttgatccg 

tactcggatg 

cgcgccagcc 

cgtgacccat 

attcatcgac 

ccgtgatatt 

tatcgccgct 

agtttaaaca 

ccccccctaa 

tgttattttc 

tcttcttgac 

tgaatgtcgt 

cgaccctttg 

cacgtgtata 

tagttgtgga 

cccagaaggt 

gtgtttagtc 

ttgaaaaaca 

tacttggctg 

tccaagtggt 

ggactgtcta 

aaatgtacac 

ccttgacacc 

tcattccggt 

cctacttgaa 

tctttcgggc 

agtctatgga 

taccgcagac 

aggtgccggc 

ccgccaccct 

gaaccggggc 



ggaactactg 

cctccaggac 

gaattgccag 

gcgtgccccc 

gcctgatagg 

aatcctaaac 

gcaggttctc 

atcggctgct 

gtcaagaccg 

tggctggcca 

agggactggc 

cctgccgaga 

gctacctgcc 

gaagccggtc 

gaactgttcg 

ggcgatgcct 

tgtggccggc 

gctgaagagc 

cccgattcgc 

gaccacaacg 

cgttactggc 

caccatattg 

gagcattcct 

gaaggaagca 

caggcagcgg 

agatacacct 

aagagtcaaa 

accccattgt 

gaggttaaaa 

cgataatacc 

catcatcact 

ctccaccgca 

tcatggtgcc 

caatgtggac 

atgcacctgc 

gcgccggcgg 

gggctcttcg 

tgccgtgtgc 

aaccactatg 

attccaggtg 

tgcgtatgca 

aggtttcggg 

aaggaccatc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500- 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
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accacgggtg cccccatcac gtactccacc tatggcaagt ttcttgccga cggtggttgc 2640 

tctgggggcg cctatgacat cataatatgt gatgagtgcc actcaactga ctcgaccact 2700 

atcctgggca tcggcacagt cctggaccaa gcggagacgg ctggagcgcg actcgtcgtg 2760 

ctcgccaccg ctacgcctcc gggatcggtc accgtgccac atccaaacat cgaggaggtg 2820 

gctctgtcca gcactggaga aatccccttt tatggcaaag ccatccccat cgagaccatc 2880 

aaggggggga ggcacctcat tttctgccat tccaagaaga aatgtgatga gctcgccgcg 2940 

aagctgtccg gcctcggact caatgctgta gcatattacc ggggccttga tgtatccgtc 3000 

ataccaacta gcggagacgt cattgtcgta gcaacggacg ctctaatgac gggctttacc 3060 

ggcgatttcg actcagtgat cgactgcaat acatgtgtca cccagacagt cgacttcagc 3120 

ctggacccga- ccttcaccat tgagacgacg accgtgccac aagacgcggt gtcacgctcg 3180 

cagcggcgag gcaggactgg taggggcagg atgggcattt acaggtttgt gactccagga 3240 

gaacggccct cgggcatgtt cgattcctcg gttctgtgcg agtgctatga cgcgggctgt 3300 

gcttggtacg agctcacgcc cgccgagacc tcagttaggt tgcgggctta cctaaacaca 3360 

ccagggttgc ccgtctgcca ggaccatctg gagttctggg agagcgtctt tacaggcctc 3420 

acccacatag acgcccattt cttgtcccag actaagcagg caggagacaa cctcccctac 3480 

ctggtagcat accaggctac ggtgtgcgcc agggctcagg ctccacctcc atcgtgggac 3540 

caaatgtgga agtgtctcat acggctaaag cctacgctgc acgggccaac gcccctgctg 3600 

tataggctgg gagccgttca aaacgaggtt actaccacac accccataac caaatacatc 3660 

atggcatgca tgtcggctga cctggaggtc gtcacgagca cctgggtgct ggtaggcgga 3720 

gtcctagcag ctctggccgc gtattgcctg acaacaggca gcgtggtcat tgtgggcagg 3780 

atcatcttgt ccggaaagcc ggccatcatt cccgacaggg aagtccttta ccgggagttc 3840 

gatgagatgg aagagtgcgc ctcacacctc ccttacatcg aacagggaat gcagctcgcc 3900 

gaacaattca aacagaaggc aatcgggttg ctgcaaacag ccaccaagca agcggaggct 3960 

gctgctcccg tggtggaatc caagtggcgg accctcgaag ccttctgggc gaagcatatg 4020 

tggaatttca tcagcgggat acaatattta gcaggcttgt ccactctgcc tggcaacccc 4080 

gcgatagcat cactgatggc attcacagcc tctatcacca gcccgctcac cacccaacat 4140 

accctcctgt ttaacatcct ggggggatgg gtggccgccc aacttgctcc tcccagcgct 4200 

gcttctgctt tcgtaggcgc cggcatcgct ggagcggctg ttggcagcat aggccttggg 4260 

aaggtgcttg tggatatttt ggcaggttat ggagcagggg tggcaggcgc gctcgtggcc 4320 

tttaaggtca tgagcggcga gatgccctcc accgaggacc tggttaacct actccctgct 4380 

atcctctccc ctggcgccct agtcgtcggg gtcgtgtgcg cagcgatact gcgtcggcac 4440 

gtgggcccag gggagggggc tgtgcagtgg atgaaccggc tgatagcgtt cgcttcgcgg 4500 

ggtaaccacg tctcccccac gcactatgtg cctgagagcg acgctgcagc acgtgtcact 4560 

cagatcctct ctagtcttac catcactcag ctgctgaaga ggcttcacca gtggatcaac 4620 

gaggactgct ccacgccatg ctccggctcg tggctaagag atgtttggga ttggatatgc 4680 

acggtgttga ctgatttcaa gacctggctc cagtccaagc tcctgccgcg attgccggga 4740 

gtccccttct tctcatgtca acgtgggtac aagggagtct ggcggggcga cggcatcatg 4800 

caaaccacct gcccatgtgg agcacagatc accggacatg tgaaaaacgg ttccatgagg 4860 

atcgtggggc ctaggacctg tagtaacacg tggcatggaa cattccccat taacgcgtac 4920 

accacgggcc cctgcacgcc ctccccggcg ccaaattatt ctagggcgct gtggcgggtg 4980 

gctgctgagg agtacgtgga ggttacgcgg gtgggggatt tccactacgt gacgggcatg 5040 

accactgaca acgtaaagtg cccgtgtcag gttccggccc ccgaattctt cacagaagtg 5100 

gatggggtgc ggttgcacag gtacgctcca gcgtgcaaac ccctcctacg ggaggaggtc 5160 

acattcctgg tcgggctcaa tcaatacctg gttgggtcac agctcccatg cgagcccgaa 5220 

ccggacgtag cagtgctcac ttccatgctc accgacccct cccacattac ggcggagacg 5280 

gctaagcgta ggctggccag gggatctccc ccctccttgg ccagctcatc agctagccag 5340 

ctgtctgcgc cttccttgaa ggcaacatgc actacccgtc atgactcccc ggacgctgac 5400 

ctcatcgagg ccaacctcct gtggcggcag gagatgggcg ggaacatcac ccgcgtggag 5460 

tcagaaaata aggtagtaat tttggactct ttcgagccgc tccaagcgga ggaggatgag 5520 

agggaagcat ccgttccggc ggagatcctg cggaggtcca ggaaattccc tcgagcgatg 5580 

cccatatggg cacgcccgga ttacaaccct ccactgttag agtcctggaa ggacccggac 5640 

tacgtccctc cagtggtaca cgggtgtcca ttgccgcctg ccaaggcccc tccgatacca 5700 

cctccacgga ggaagaggac ggttgtcctg tcagaatcta ccgtgtcttc tgccttggcg 5760 

gagctcgcca caaagacctt cggcagctcc gaatcgtcgg ccgtcgacag cggcacggca 5820 

acggcctctc ctgaccagcc ctccgacgac ggcgacgcgg gatccgacgt tgagtcgtac 5880 

tcctccatgc ccccccttga gggggagccg ggggatcccg atctcagcga cgggtcttgg 5940 

tctaccgtaa gcgaggaggc tagtgaggac gtcgtctgct gctcgatgtc cCacacatgg 6000 

acaggcgccc tgatcacgcc atgcgctgcg gaggaaacca agctgcccat caatgcactg 6060 

agcaactctt tgctccgtca ccacaacttg gtctatgcta caacatctcg cagcgcaagc 6120 

ctgcggcaga agaaggtcac cttCgacaga ctgcaggtcc tggacgacca ctaccgggac 6180 

gtgctcaagg agatgaaggc gaaggcgtcc acagttaagg ctaaacttct atccgtggag 6240 

gaagcctgta agctgacgcc cccacattcg gccagatcta aatttggcta tggggcaaag 6300 
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gacgtccgga acctatccag caaggccgtt aaccacatcc gctccgtgtg gaaggacttg 6360 

ctggaagaca ctgagacacc aattgacacc accatcatgg caaaaaatga ggttttctgc 6420 

gtccaaccag agaagggggg ccgcaagcca gctcgcctta tcgtattccc agatttgggg 6480 

gttcgtgtgt gcgagaaaat ggccctttac gatgtggtct ccaccctccc tcaggccgtg 6540 

atgggctctt catacggatt ccaatactct cctggacagc gggtcgagtt cctggtgaat 6600 

gcctggaaag cgaagaaatg ccctatgggc ttcgcatatg acacccgctg ttttgactca 6660 

acggtcactg agaaCgacat ccgtgttgag gagtcaatct accaatgttg tgacttggcc 6720 

cccgaagcca gacaggccat aaggtcgctc acagagcggc tttacatcgg gggccccctg 6780 

actaattcta aagggcagaa ctgcggctat cgccggtgcc gcgcgagcgg tgtactgacg 6840 

accagctgcg gtaataccct cacatgttac ttgaaggccg ctgcggcctg tcgagctgcg 6900 

aagctccagg actgcacgat gctcgtatgc ggagacgacc ttgtcgttat ctgtgaaagc 6960 

gcggggaccc aagaggacga ggcgagccta cgggccttca cggaggctat gactagatac 7020 

tctgcccccc ctggggaccc gcccaaacca gaatacgact tggagtCgat aacatcatgc 7080 

tcctccaatg tgtcagtcgc gcacgatgca tctggcaaaa gggtgtacta tctcacccgt 7140 

gaccccacca ccccccttgc gcgggctgcg tgggagacag ctagacacac tccagtcaat 7200 

tcctggctag gcaacatcat catgtatgcg cccaccttgt gggcaaggat gatcctgatg 7260 

actcatttct tctccatcct tctagctcag gaacaacttg aaaaagccct agattgtcag 7320 

aCctacgggg cctgttactc cattgagcca cttgacctac ctcagatcat tcaacgactc 7380 

catggcctta gcgcattttc actccatagt tactctccag gtgagatcaa tagggtggct 7440 

tcatgcctca ggaaacttgg ggtaccgccc ttgcgagtct ggagacatcg ggccagaagt 7500 

gtccgcgcta ggctactgtc ccaggggggg agggctgcca cttgtggcaa gtacctcttc 7560 

aactgggcag taaggaccaa gctcaaactc actccaatcc cggctgcgtc ccagttggat 7620 

ttatccagct ggttcgttgc tggttacagc gggggagaca tatatcacag cctgtctcgt 7680 

gcccgacccc gctggttcat gtggtgccta ctcctacttt ctgtaggggt aggcatctat 7740 

ctactcccca accgatgaac ggggagctaa acactccagg ccaataggcc atcctgtttt 7800 

tttccctttt tttttttctt tttttttttt tttttttttt tttttttttt ttctcctttt 7860 

tttttcctct ttttttcctt ttctttcctt tggtggctcc atcttagccc tagtcacggc 7920 

tagctgtgaa aggtccgtga gccgcttgac tgcagagagt gctgatactg gcctctctgc 7980 

agatcaagta cttctagaga attctagctt ggcgtaatca tggtcatagc tgtttcctgt 8040 

gtgaaattgt tatcagctca caattccaca caacatacga gccggaagca taaagtgtaa 8100 

agcctgggat gcctaatgag tgagctaact cacattagtt gcgttgcgct cactgcccgc 8160 

tttccagtcg ggaaacctgt cgtgccagct ccattagtga atcgtccaac gcacggggag 8220 

aggcggtttg cgtattgggc gcacttccgc ttcctcgctc actgactcgc tgcgctcgtt . 8280 

cgttcggctg cggcgagccg tatcagctca ctcaaaggcg gtaatacggt tatccacaga 8340 

atcaggggat aacgcaggaa agaccatgtg agcaaaaggc cagcaaaagg ccaggaaccg 8400 

taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg agcatcacaa 8460 

aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat accaggcgtt 8520 

tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta ccggatacct 8580 

gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct gtaggtatct 8640 

cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc ccgttcagcc 8700 

cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa gacacgactt 8760 

atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg taggcggtgc 8820 

tacagagttc ttgaagtggt ggcctaacta cggctacact agaaggacag tatttggtat 8880 

ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt gatccggcaa 8940 

acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta cgcgcagaaa 9000 

aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc agtggaacga 9060 

aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca cctagatcct 9120 

tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa cttggtctga 9180 

cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat ttcgttcatc 9240 

catagttgcc tgactccccg tcgtgtagat aactacgata cgggagggct taccatctgg 9300 

ccccagtgct gcaatgatac cgcgagaacc acgctcaccc gcaccagatt tatcagcaat 9360 

aaaccagcca gccggaagtg cgctgcggag aagtggtccC gcaactttat ccgcctccat 9420 

ccagtctatt agttgttgcc gggaagctag agtaagtagt tcgccagtca gcagtttgcg 9480 

taacgtcgtt gccatagcaa caggcatcgt ggtgtcacgc tcgtcgtttg gtatggcttc 9540 

attcagctcc ggctcccaac gatcaaggcg agttacatga tcccccatgt tgtgcaaaaa 9600 

agcggttagc tccttcggtc ctccgatcgt tgtcagaagt aagttggccg cagtgttatc 9660 

actcatggtt atggcagcac tgcataattc tcttactgtc atgccatccg taagatgctt 9720 

ttctgtgact ggtgagtact caaccaagtc attctgagaa tagtgtatgc ggcgaccgag 9780 

ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca catagcagaa ctttaaaagt 9840 

gctcatcatt ggaaaacgtt cttcggggcg aaaactctca aggatcttac cgctgttgag 9900 

atccagttcg atgtaaccca ctcgtgcacc caactgatct tcagcatctt ttactttcac 9960 

cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg gaataagggc 10020 
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gacacggaaa tgttgaatac tcatactctt cctttttcaa tattattgaa gcatttatca 10080 

gggttattgt ctcatgagcg gatacatatt tgaatgtatt tagaaaaata aacaaatagg 10140 

ggttccgcgc acatttcccc gaaaagtgcc acctgacgtc taagaaacca ttattaccat 10200 

gacattaacc tataaaaata ggcgtatcac gaagcccttt cgtctagcgc gtttcggtga 10260 

tgacggtgaa aacctctgac acttgcagct cccgcagacg gtcacagctt gtctgtaagc 10320 

ggatgccggg agcaggcaag cccgtcaggg cgcgtcagtg ggtgttggcg ggtgtcgggg 10380 

ctggcttaac tatgcggcat cagagcagat tgtactgaga gtacaccaga tgcggtgtga 10440 

aataccgcac agatgcgtaa ggagaaaata ccgcatcagc ctccattcgc cattcagact 10500 

ccgcaactgt tgggaagggc ggtcagtacg cgcttcttcg ctattacgcc aactggcgaa 10560 

agggggatgt gctgcaaggc gattaagttg ggtaacgcca gggttttccc aatcacgacg 10620 

ttgtaaaacg acagccaatg aattgaagct tattaattct agactgaagc ttttaatacg 10680 

actcactata 10690 



<210> 4 
<211> 23 
<212> DNA 

<213> Primer oligonucleotide 
<400> 4 

acatgatctg cagagaggcc agt 23 

<210> 5 
<211> 26 
<212> DNA 

<213> Primer oligonucleotide 
<400> 5 

gacasgctgt gatawatgtc tccccc 26 



<210> 6 
<211> 21 
<212> DNA 

<213> Primer oligonucleotide 
<400> 6 

tggctctcct caagcgtatt c 21 

<210> 7 
<211> 23 
<212> DNA 

<213> Primer oligonucleotide 
<400> 7 

actctctgca gtcaagcggc tea 23 



<210> 8 
<211> 21 
<212> DNA 

<213> Primer oligonucleotide 
<400> 8 

cagtggatga accggctgat a 21 

<210> 9 
<211> 23 
<212> DNA 

<213> Primer oligonucleotide 
<400> 9 

ggggcgacgg catcatgcaa acc 23 
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<210> 10 
<211> 23 
<212> DNA 

<213> Primer oligonucleotide 
<400> 10 

caggacctgc agtctgtcaa agg 

<210> 11 
<211> 17 
<212> DNA 

<213> Primer oligonucleotide 
<400> 11 

cgggagagcc atagtgg 

<210> 12 
<211> 19 
<212> DNA 

<213> Primer oligonucleotide 
<400> 12 

agtaccacaa ggcctttcg 



17 



<210> 13 
<211> 21 
<212> DNA 
<213> Probe 

<400> 13 

ctgcggaacc ggtgagtaca c . 21 
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